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LEAST SQUARE MULTIPLE CLASSIFICATION FOR 
UNEQUAL SUBGROUPS! 


PAUL HORST 


University of Washington 


INTRODUCTION 


In a recent article®’ the writer proposed and illustrated a method of multiple 
classification which is based on a simple multiple regression model rather than the 
multiple discriminant function technique proposed by Rao“ and others. Both 
methods begin with a group of persons which has been broken down into mutually 
exclusive subgroups. A set of measures such as test scores is assumed to be available 
for each member of the total group. The general problem is to determine as many 
different linear combinations of the measures as there are subgroups. These linear 
combinations may be called classification scores. They are to be determined in such 
a way that when each person is classified according to his highest classification score 
the persons in the subgroups so formed correspond as closely as possible to those in 
the original corresponding subgroups. The presumption is that the sets of weights 
determined for making the linear combinations then may be used subsequently for 
obtaining classification scores on persons whose group classification has not yet been 
determined, and that these scores may be useful in assigning them to appropriate 
groups for special consideration, such as appropriate therapy, or what not. 

One condition which may be required reasonably of any multiple classification 
system is that when applied to the groups upon which it was developed it should 
classify the same number of persons in each subgroup as were originally classified in 
that group. Knutsen“? has given an example where both the methods of Rao and 
Horst yield subgroup numbers markedly disparate from those in the original classi- 
fications. It is not difficult to see how this might occur. If we take the extreme case 
for the “least square’? method in which there is no relationship between the test 
scores and the group classifications, then the “least square” solution would give the 
mean of a group as the classification score of all persons for that group. In the “‘least 
square” multiple classification method each subgroup is regarded as a criterion 
variable in which a person’s score is ‘“‘1” if he belongs to the group and ‘‘O” if he does 
not. Therefore, the mean of each criterion variable is simply the percentage of the 
total group belonging in the subgroup. 

Let us consider then what would happen in the least square classification tech- 
nique if the multiple correlation between the independent variables and each of the 
dichotomous criterion variables were zero. This would mean that for each person, 
his classification score for each subgroup would simply be the percentage of the total 
group in that subgroup. Therefore, if each person were then classified on the basis 
of his highest classification score, all persons would be classified in the subgroup 
which in the original classification was the largest. This is, of course, an extreme case. 
But in general the less discriminating the independent variables, the greater will be 
the tendency of the least square classification technique, as such, to throw all cases 
into the subgroup which was originally the largest. 


METHOD 


To overcome this difficulty a comparatively simple modification of the method 
previously presented suggests itself. In the original method ®? we have required that 
the regression weights plus the additive constant when applied to the independent 
variables give the best fit to a given criterion all of whose values are either ‘‘0”’ or 


1This research was carried out under Research Grant M-743 MH (1) from the Public Health 
Service, National Institutes of Health. Credit is due Charlotte MacEwan for major editorial and 
computational assistance, Jo Conaway for special competence in typing the manuscript. Supervision 
of computational and editorial activities was provided by William V. Clemans. 
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“1.” This procedure automatically yields a mean classification score for a given sub- 
group 2 equal to the proportion p; of the total group in that subgroup. The variance 
of the classification scores by the method of least squares is R,* piqi, where R; is the 
multiple correlation of the independent variables with the dichotomous criterion for 
group 7. We may also require, however, that the variance of the classification scores 
for category 7 be equal to that of the criterion. To show how we may adapt the “‘least 
square”’ method to satisfy this condition we shall use the same example as that given 
in reference ®?. 


1. We begin with a table of raw test score means on tests A, B, and C for 
each of six different groups. These are given in Table 1. 


TasBLe 1. MEAN Scores oF Neurotic Groups (Rao AND SLATER, 1949) 


Group 


Factors | Anxiety Hysteria Psycho- Obsession Personality Normal Total Test Means, 
State pathy Change Number Total Sample 

Sample Size 114 33 32 17 5 55 256 
| 
Mean Score | 


d | 2.9298 3.0303 .7059 1.4000 0.6000 2.6406 
B 1.1667 1.2424 .5882 0.2000 0.1455 1.0508 
C 0.7281 0.5455 1176 0.0000 0.2182 0.6172 





2. Next we subtract the total group mean for a given test from each of the 
group means. This operation gives the matrix of deviation score group means 
shown in Table 2. The first entry here, for example, is 2.9298 — 2.6406 = .2892. 


TABLE 2. DeEvIATION ScorE Group MEANS 


Group 


Test Anxiety Hysteria Psychopathy Obsession Personality Normal 
State Change 


—1.2406 —2.0406 
.7930 Oe — .8508 — .9053 
.1953 5 — .6172 — .3990 


3. The matrix of test score variances and covariances for the total group 
is computed. This is given in Table 3. 


TaBLE 3. Trst Scorgk VARIANCES AND COVARIANCES 








.7609 
. . 1640 1.9336 
. 7609 . 16+ .6503 1.5752 


4. ‘The next step is to premultiply the matrix in Table 2 by the inveise of 
the matrix in Table 3. Methods for performing such multiplications without 
first finding the inverse are well known". The results of these computations 
are given in Table 4. 
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Tasie 4. THe Marrix Gym 
Group 
Anxiety Hysteria Psychopathy Obsession Personality Normal 
State Change 





.0395 .1360 .1535 .5127 ; — .4130 
0715 . 1369 7506 .0702 832: — .6131 
. 1062 — .3035 — .0686 


5. Next, corresponding entries of column 1 of Table 2 and column 1 of 
Table 4 are multiplied and summed to get the first entry in the first row of Table 
5. For example, this entry is 

.2892 x .0395 + .1159 x .0715 + .1109 x .1062 = .031488. 


Using corresponding columns of Table 2 and Table 4 the remaining entries in 
row 1 of Table 5 are calculated. 
TABLE 5. CALCULATIONS For ApsustTING LINEAR DISCRIMINANT WEIGHTS - 


Group 
Computa- 
tional | Anxiety Hysteria Psychopathy Obsession Personality Normal 
Steps 5-9 | State Change 


Row 1 | (031488 .101012 761715 1.172666 1.154650 1.388112 
= 445312 . 128906 . 125000 066406 .019531 .214844 
. 247009 . 112289 109375 .061996 019150 . 168686 
7.844544 1.111640 . 143590 .052868 016585 .121522 
2.8008 1.0544 .3789 . 2299 . 1288 .3486 





6. Each p; is calculated and entered in row 2 of Table 5. For example, 
the first entry is 114 + 256 = .445312. 

7. Each p,; is multiplied by 1 — p; and entered in row 3 of.Table 5. For 
example, the first entry is .445312 x .554688 = .247009. 

8. Each entry in row 3 of Table 5 is divided by the corresponding entry 
in row 1 to give the corresponding entry in row 4. For example, we have, for 
the first entry, .247009 + .031488 = 7.844544. 

9. Each entry in row 5 of Table 5 is the square root of the corresponding 
entry in row 4. For example, the first entry is ¥ 7.844544 = 2.8008. 

10. Next, each entry in column 1 of Table 4 is multiplied by the first entry 
in row 5 of Table 5, and the products are entered as the first three entries in the 
first column of Table 6. For our data the entries in the first column of Table 6 
are, respectively: 

.0395 x 2.8008 = .1106; .0715 x 2.8008 = .2003; .1062 x 2.8008 = .2974. 


Tape 6. LIngAR DIscrIMINANT WEIGHTS, B, AND Constant TERMS 





Group 


= | F . ; : ‘ 
Tests | Anxiety Hysteria Psychopathy Obsession Personality Normal 
| State Change 





- 1106 .1434 .0582 .1179 .0024 — .1440 
| 2003 .1443 . 2844 .0161 — .1072 =. 
. 2974 — .3203 — .0260 .0349 — .0980 

— .2037 — .3115 . 2834 . 1863 
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The remaining entries in the first three rows of Table 6 are obtained by using the 
successive columns from Table 4 and the corresponding elements from row 5 of 
Table 5. The columns in the first three rows of Table 6 give linear discriminant 
weights such that, when applied to the raw scores, the variances of the resulting 
classification scores are equal, respectively, to the pq values for the subgroups. 
We still have to determine the constant terms, K, so that the means of the classi- 


fication scores are equal, respectively, to the p values for these subgroups. These 
we compute in step 11. 


11. From the p value for a given group we subtract the product sum of 
total group means by the corresponding weights from Table 6. For example, 
the constant term for the anxiety state is 


.4453 — (2.6406 x .1106 + 1.0508 x .2003 + .6712 x .2974) = -—.2408. 
These values are entered in row K of Table 6. 


We can readily prove that the linear discriminant weights together with 
the constant terms in Table 6 do yield sets of classification scores with means 
and variances equal to means and variances of the corresponding dichotomous 
criteria. This we show in the appendix. 


APPENDIX 
THe MEANS AND VARIANCES OF THE CLASSIFICATION SCORES 
We define our notation as follows: 


xX isan N x n matrix of raw score measures where N is the number of cases and n 
the number of variables. 


x is the same matrix for deviation scores. 

L is an N xs classification matrix with a “1”’ in each row and all remaining ele- 
ments “0.’’ Each row of L corresponds to a person and each column to a classi- 
fication category. A person has a score of ‘‘1” for the category in which he has 
been classified and ‘‘O” for all others. There are s categories. 

p; is the proportion of cases in group 7. 

D is the diagonal matric of the p,’s. 
is an n x s matrix of raw score means for each group on each test. 

m isannxs matrix of deviation score means for each group on each test. 

b is an n x s matrix of least square regression weights. 

B is an n x s matrix of s vectors of desired discriminant weights. 

KX _isa vector of order s of constant terms to be used with the discriminant vectors. 

First we shall consider the solution for the b vectors which will yield the best 


“least square’”’ fit of classification scores to the corresponding classification vectors 
in L. We have by definition 


x=(-4)x , (1) 
where 1 is a unit vector of order N. We let 


= my? (2) 


Now (1) and (2) give us the predictor and criterion matrices in deviation form. Hence, 
as is well known, the solution for the matrix of regression vectors is 


x’x.1 x’] 


b=G) F - (3) 
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But from (1), (2), and (3) it is readily proved that 
x’x,7) x’L 
b=-W) Wy (4) 
It can also be shown that 
dF 
+ m D, (5) 
If we let Gr be the covariance matrix of the total group, we have by definition 


x’x 
G; = NY: (6) 


From (4), (5), and (6), 
b= G;'mD, ; (7) 
Now Table 2 is simply m and Table 3 is Gr. By the method of computation, 
therefore, Table 4 is the matrix of vectors proportional to the “least square’’ re- 
gression vectors b where the proportionality factors are given in D,. It remains to 


show, however, that Table 6 yields the best “least square’”’ discriminant weights 
with the specified restrictions on the variances of the classification scores. 


We next consider the development of the general linear “‘least square”’ solution 
for a set of predictors and a criterion with a restriction on the estimated criterion 
variance. We let z be the deviation predictor matrix, y the deviation criterion vector, 
oy’ the variance of the criterion, and B the restricted regression vector. We write 


xB-y = « : (8) 
By definition 

ee rat. (9) 
However, according to the restriction, we also have 


= oy? ‘ (10) 


We wish to minimize ¢’ « with the restriction in equation 10. We write 
@ = € e—ho;” : (11) 


where ) is the Lagrangian multiplier. 
Substituting (8) and (9) in (11), we have 
@ = B’x’xB - B’x’y - y’xB + y’y- x B’x’xB 
Differentiating (12) with respect to B’ and equating the result to zero we get 
x’xB - x'y - ¥ x’xB = 0 


From (13) 


, , 1 
B = (x’x)'x yG—>) 
N 





314 PAUL HORST 


or letting 
1 
i- > 
N 


=C ; (15) 


we have 
B = (x’x)? x’yC : (16) 


Equation 16 gives the solution in the familiar form for the conventional ‘least 
square”’ regression vector except for the scalar C. This shows that, whatever restric- 
tion we place on the variance of the estimated criterion, the restricted “least square”’ 
regression vector will be proportional to the conventional one. We can very simply 
determine C. Substituting the right side of (16) into the left side of (10), we get 


C y’x ( %) “1 y’y ” 

W% x (17) 
From (16) and (17),: ieteed 
/ oy 


B = (x’x)" eS > 2 


In passing, it is of interest to point out the relationship between equation 18 
and well known multiple regression constants. Suppose, for example, that the pre- 
dictors and criteria are all in standard units. Then the matrix of predictor inter- 
correlations is given by 

x’x 
N - Pxx ? (19) 


and the vector of validity coefficients by 
(20) 
Using (18), (19), and (20), we get 


1 
B = tty —;—;— 
yy a 
Vv t ee xy 


(21) 


But if we let 8 be the vector of regression weights when all measures are in standard 
units and &,, be the multiple correlation of y with the z variables, then as is well 
known, 


B = ry Ny , (22) 
Ras? * Se Tu* hie ’ (23) 
Using (22) and (23) in (21), we have 
“a 
B= Res : (24) 


To see what happens in the special case considered in this article we go back to 


a 18. If we substitute for y the ith column of 1 and designate this as 1; we 
lave 


x’y = x’l, 
But because of (4) and (5) we may rewrite (25) as 


xX 
52. — Mp; 
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where m, is the 7th column of m. Using (26) and (6) in (18), 


2 


Bi= Grimm] en 
Pim; Gr! m, p; 
where B, is the 7th column of B. 
Suppose now we specify that the variance of the estimated criterion be the var- 
iance of the corresponding dichotomous criterion, viz., pyqi, so that 
oy” = Pid ; (28) 
If we let b.; be the ith column vector of b, we have from (7), (27), and (28) 
B,=b; | pa. (29) 
NV p, my’ b. 
Suppose we let 
h _ Ba 
Pi 
Then from (29) and (30), 
B, = hy 


| Pdi | 
NV mh, 


It is readily verified that h, is the tth column vector from Table 4. 


Furthermore: 
Step 5 gives the m,’ h, values. 
Step 7 gives the p,q; values. 
Steps 8 and 9 give the values, Pi 
Nomi hg 
Step 10 gives the B, vectors in Table 6. 


To get the vector K of constant terms we have simply to write 
1’ (XB +1K’) =1'(il'D,) (32) 
where the “‘1’s” are conformable unit vectors, or 
M,’ B + K’ = 1D, (33) 
where M 7’ is a row vector of total group means for the tests. From (33), 
K’ = 1'D, - M7’B : (34) 


It can be readily verified that step 11 consists of the computations indicated by (34). 
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DOES TAT ADD INFORMATION TO INTERVIEWS? 
STATISTICAL ANALYSIS OF THE INCREMENT! 


ROBERT F. WINCH AND DOUGLAS M. MORE 


Northwestern University McMurry, Hamstra and Co., Chicago 


PROBLEM 


‘+ The purpose of this paper is to determine whether indirect procedures of assess- 
ing such motivational variables as dominance and achievement yield information 
beyond that contained in direct procedures. Or, since our data-gathering and analy- 
sis proceeded from the most direct to the least direct procedures, at what point might 
we have stopped this process without significant loss of information? Since the more 
indirect procedures appear also to be the ‘‘deeper’’ procedures, we may phrase this 
question: ‘‘How deeply did we need to probe in order to understand the personality 
features at issue?” ‘“‘Information”’ here is defined in terms of quantitative ratings 
not in terms of verbal observations or interpretations. The nature of the analysis, 
moreover, involves the condition of linearity of regression. 


PROCEDURE 


The sample consists of both spouses of 25 married couples interviewed in 1950, 
at which time at least one member of each couple was an undergraduate student i in 
selected schools of Northwestern University. The students were of the ‘‘majority” 
in the sense that they were white, middleclass, and native-born.” 

The major variables of this study consist of twelve needs and three general 
traits adapted from the formulations of Murray®?. Some of these variables have 
been dichotomized into ‘‘overt”’ and ‘‘covert”’ levels of expression. Some were dicho- 
tomized situationally into expression ‘‘within” and ‘‘without”’ the marriage. Both of 
these dichotomies have been applied to some variables. Table 1 lists the variables 
used, gives their definitions, and shows which dichotomies have been applied. From 
the table it will be seen that the dichotomies and double-dichotomies convert the 15 
variables into 44 sub-variables. All Ss were given: (a) a semi-structured “need- 
interview” designed to elicit evidence regarding the major variables; (b) a less struc- 
tured case-history interview; and (c) and an eight-card TAT 

Ratings on every § with respect to each of the 44 / -type subvariables 
noted in Table 1 were derived from: (a) the need-interview by means of content- 
analysis designated as NI, in the footnote to Table 1 and as X, in the column head- 
ings; (b) the need-interview by means of a ok egg holistic analysis“: pp. 447-448) 
designated as NI, in the footnote of Table 1 and as X, in the column headings; (c) 
the case history by means of a projective-holistic analysis designated as CH in the 
footnote and as X; in the headings; and (d) the TAT by means of the usual pro- 
jective-holistic analysis shown as X, in the column headings. 

Using all of the above sources of information, a five-person conference rated 
every S on every variable. Because this set of ratings was based on all information 
and was made by a larger number of analysts (five) than any other set, we regard it 
as our “‘best’”’ or criterion ratings.’ This set of ratings is referred to in the footnote to 
Table 1 as FC and appears in the columns as the dependent variable Y. 

The two sets of ratings from the need-interview plus one set each from the case 
history and from the TAT constitute the four independent variables of our 44 re- 
gression equations. They also represent a hierarchy or rank-order in terms of direct- 
ness of data-gathering and analysis. The need-interview consisted of 45 questions 


1This investigation was supported by a research grant (MH-439) from the National Institute of 
Mental Health, United States Public Health Service. 

2?More complete information about the sample appears in (3). 

3In accordance with this concept the final conference ratings (FC) have been used as the dependent 
variable in estimating the validities of the other procedures. (4, pp. 450-452 and Table 4.) 
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designed to elicit evidence on needs and traits. Therefore, it is a more direct method 
of gathering data than is the case history, wherein the S$ talks about his childhood and 
subsequent relationships, and than the TAT, wherein he produces fantasies. More- 
over, a content-analysis is more direct than a holistic-projective analysis. Our rank- 
ing of these four independent variables, then, from most direct to least direct is: 
NL, Nl, CH, and TAT. 

The question initially posed in this paper may be put into statistical form by 
asking whether or not a second (or less direct) predicting variable significantly im- 
proves the prediction of the criterion scores made by a single (or more direct) pre- 
dicting variable. Operationally this is equivalent to asking whether or not Ry 12 is 
significantly greater than ry;. Or in the context of the analysis of variance, this 
becomes: is the mean square associated with the multiple regression significantly 
greater than the mean square for the residual variance? By extension of this pro- 
cedure it is possible to take the rank order of the directness of the data-gathering pro- 
cedures and modes of analysis noted above and then by means of multivariate an- 
alysis to determine at what point the use of further techniques fails to produce further 
information. 


RESULTS 


Computationa! and statistical details of the procedure are as follows. We have 
50 subjects and 44 sub-variables. Including the final conference ratings—our cri- 
terion procedure—we have five procedures in terms of which each S has a rating on 
every sub-variable. Ratings are normalized within sex and within sub-variable on 
ach procedure. Within sub-variables all of the inter-procedure correlations (zero- 
order) are run. 

Then we begin—within each sub-variable)—to build up a multivariate system 
to an ultimate total of five variables (or a system with one dependent and four pre- 
dicting variables). To illustrate let us examine the first row of Table 1. The zero- 
order correlation between the criterion (iC) and the most direct predictor (NI) is 
.73. This means that the content analysis of the need-interview accounts for 53 
percent of the variance (r’y; = .53) and in this sense accounts for 53 percent of the 
information contained in the ratings made by the final conference. We then ask the 
question: if we take account of the contribution of the next most indirect procedure 
(NIy, or holistic analysis of the need-interview), shall we have improved our pre- 
diction of the FC ratings significantly, and in this sense gained information? We see 
that Ryj2 = .87, or that the two predictors together (NI; and NI:) account for 75 
percent of the variance R*y;2 = .75). The F test shows that the increment of 
variance explained by the NI, is significant at the .01 level. 

We now continue to build up the system by adding a third predictor, the case 


history interview, and Ry.j2; = .90. The explained variance has now jumped to 81 
percent (R*y.123 = .81), and again we have had a significant increment of informa- 
tion. 

Let us jump to the last column in row 1. Here we see that Ry.j231 = .90, 


which is also the value of Ry.j23. Since these values are the same, it is evident that 
whatever the TAT ratings ‘‘told”’ us, they did not contribute to any statistical ex- 
planation of the criterion ratings, and hence on this sub-variable the TAT did not 
add an increment of information. 

The second-last column shows what would have happened if we had regarded 
the TAT as more direct than the case history. Or, statistically, it answers the ques- 
tion: how much information does the TAT add beyond that derived from the NI, 
and Nik, #.e., with the CH disregarded? Since in the first row Ryi2s = Ryj2 = .87, 
the answer is that with respect to abasement overt-within the TAT adds nothing, 
to what we already knew (in terms of ratings) from the two analyses of the need- 
interview. 


‘These are summarized in (4), Tables 3 and 4. 
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Every Ry.z2 is significantly greater at the .01 level than the corresponding 
ry1. Thus for every sub-variable by trying to see the case ‘‘whole” we add materially 
to the information derived from the direct and objective analysis of the need-inter- 
view. 

Thirty-five of the Ry.j23’s are significantly greater than the corresponding 
Ry.2’s at the .01 level; 7 are significantly greater at the .05 level; 2 are not sig- 
nificantly greater at the .05 level. Since in more than three-quarters of the sub- 
variables the holistic analysis of the case history adds a real increment of information 
beyond the NI, it appears that the historical view of the S contained in this pro- 
cedure contributes information beyond that contained in the more ahistoric need- 
interview. 

It appears that the few significant increments ansing from the TAT (Ry.123 
compared with Ryj2, and Ry 12.4 compared with Ry.123) are no more numerous than 
might be expected by chance alone. Thus the TAT appears to have added no statis- 
tically discernible increment of information beyond that contained in the NI plus 
the CH or in the NI alone. Therefore the information on these Ss was virtually com- 
plete with the holistic analysis of the case history and before the taking of the TAT. 


DiscusstIon 

An immediate reaction to this finding might be that the TAT was useless in this 
study. It seems indisputable that the TAT was useless in accounting for higher 
order increments of variance under the assumption of linearity of regression. Pending 
further study, however, the authors are not ready to assert that the TAT was with- 
out qualification “useless in this study.” Rather, it is our clinical impression that 
sometimes the blind analysis of the TAT produced ‘‘bull’s-eye”’ reports, that some- 
times the analyses were on the target but not bull’s-eyes and that sometimes they 
were quite wild. Further, it is our impression that the ‘‘wild”’ errors frequently arose 
from the fact that the analysis was performed ‘‘blind.’”’ Under such circumstances, 
the usual procedure for administering and analyzing the TAT does not seem to give a 
high rate of assurance that the analyst will recognize the subject’s true identification- 
figure in each story.’ Knowing something about the subject from the interviews can 
enable the analyst to reduce his errors. For example, in one case a subject’s mother 
had died when he was about five, and he had been reared from age ten by a step- 
mother who treated him in a rather harsh, rejecting fashion. In the absence of this 
information the analyst was disposed to paint a portrait of the man’s feeling about 
his mother as characterized by extreme ambivalence—a vacillation between strong 
affectional ties and attitudes of hatred and fear. If the analyst had known that there 
had been in effect two strikingly different mother-figures in the subject’s background, 
this complex picture of the mother might have been unravelled from the projective 
information. 

It has been a not uncommon experience in our research that after such informa- 
tion is added to the TAT analysis in the final conference, the corrected interpretation 
seems to add considerable understanding beyond that possible from the interviews 
alone. But, it must be remembered that the TAT ratings used in the present statisti- 
cal analysis were produced without such background information. These remarks 
are not intended to nullify but only to qualify the remark that the TAT made no 
contribution to the statistical analysis of our data. 


SUMMARY 


The purpose of this study has been to see whether or not less direct methods of 
gathering and analyzing personality data add information beyond that derived from 
more direct methods. From most to least direct the methods are: content analysis 
of a semi-structured interview (NI,), holistic analysis of the same interview (N12), 


5Professor H. A. Murray has atte mpted to correct for this difficulty by incorporating into his 
instructions to the S an injunction to begin each story by naming the central figure or hero. It is 
his hope that this will lead the TAT analyst more consistently to the identification-figure. 
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content analysis of a case history interview and holistic analysis of a TAT. As the 
quantified information from each source is added to the previous information, a test 
is made to determine whether or not the increment is of statistically significant mag- 
nitude. In general significant increments are added except for the TAT which makes 
no statistically discernible contribution. 


TECHNICAL ADDENDUM 
The general form of the equation on which this analysis was based is: 
(1 — R*, 13) nani (1 _ R*, ijk) he 
F = . 
ny (1 — R’y.ijx) 


where n; = 1 = the degree of freedom associated with regression on the independent 
variable whose increment of information is under scrutiny, and ng equals the number 
of degrees of freedom associated with residuals from the higher order plane of re- 
gression (y.ijk). Since n; = 1, this may be simplified to: 


ne [(1 R*,4) - a —- R?, ix) ] ne (Ry: — R*y.x). 


= sa a es me SO 


1 — R’y isk 1 — Rrix 


We are indebted to Professor Leo Goodman of the University of Chicago for 
calling our attention to a criticism of this line of analysis“. In this critique Kemp- 
thorne raises two questions which are worthy of consideration. 

In the first place he suggests that the mean square for residuals— (1 — R*,ijx)/ne, 
in the last expression above—should be computed from the highest order plane of 
regression rather than from the Ry jjx plane under scrutiny. It seems to us that there 
are arguments for doing it either way and that the practical consequence of this issue 
is very small, especially as » becomes large. It appears that the effect of leaving in 
the possible bias of which Kempthorne speaks (by using the mean square for residuals 
from anything other than highest order regression) would be to increase the proba- 
bility of accepting the null hypothesis. All three of the differences noted in Table 1 
(cf. notes f and §) are in this direction. 

Kempthorne’s second point is that the b’s in the lower order plane of regression 
(e.g., byi.23) will be biased if a b in a higher order regression (e.g., by4.123) is not 
equal to zero. On page 533 he gives an expression for the bias when the three- 
dimensional problem is stated erroneously as a two-dimensional problem. To us this 
does not seem to be a bias in the usual sense unless we regard by ij as an estimate 
not only of 8,i; but also of By ijx. Instead of the procedure used here, Kempthorne 
suggests an iterative procedure which begins with tests of significance for all of the 
highest order b’s. When a b is found to be non-significant, the variate corresponding 
to the second subscript (7.e., the independent variable which is allowed to vary) is 
eliminated from the analysis. Then the b’s are recomputed on the next lower order, 
and the system continues to be shaken down until there is some order at which all 
b’s are significantly different from zero. 
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CONVERSION HYSTERICS AND THE MMPI 
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PROBLEM 


Despite the wide use of the Minnesota Multiphasic Personality Inventory 
(MMPI) with clinical populations there have been few studies concerned with the 
MMPI scores of a single, homogeneous, diagnostic group. Generally interscale cor- 
relation coefficients and/or means and standard deviations for a heterogeneous 
sample of ‘‘abnormals” or neuropsychiatric patients are given “® ' 5-16. The study 
by Wauck “* on schizophrenics is a possible exception although his sample included 
several ‘‘types”’ of schizophrenia. For comparison purposes Wauck’s report is some- 
what unsatisfactory; for example he (like many others) did not indicate whether or 
not K-corrected MMPI scores were used. Wauck presented interscale correlations 
only with the Se scale. A further criticism is that while his data show the highest 
mean T-score (69.40) for his schizophrenics was obtained on Sc, he stated that an 
“item analysis’’ disclosed all the items in the Se scale were ‘‘non-discriminatory”’! 
The purpose of this paper is to present the MMPI scores of a homogeneous sample 
of clinically diagnosed conversion hysterics and particularly to point out the effect 
= interscale correlation coefficients (r’s) when ‘‘corrected” clinical scales are 
used. 


SAMPLE 


The sample consists of 63 females who were diagnosed “‘psychoneurosis, con- 
version hysteria” and tested with the MMPI since 1943 in the University Hospital, 
Minneapolis, Minnesota. The data were collected from the files of Dr. 8. R. Hatha- 
way.' All available cases were used; the questionable practice of discarding cases 
having high L, F, or K scores was not followed. Not one of the 63 cases had been 
involved in the original validation of the Hy scale. This sample, as was Wauck’s, is 
a cross-validation sample. Most of the patients on entering the hospital had taken 
the individual card form; a minority had taken the group form. 


PROCEDURE 


Scores on each case were obtained for the 36 variables listed in Table 1. It is 
to be noted that several “‘corrected’’ Hy scores have been obtained. In two recent 
articles“: 7) the writer has shown that some of the MMPI scales appear to be over- 
corrected and some appear to be under-corrected for K. Specifically it was shown 
that the validity of Hy (in which 78 per cent of the items are scored false) could be 
improved by subtracting a certain fraction of K (in which 97 per cent of the items are 
scored false) or by adding a certain fraction of a response set scale in which the items 
are scored true. Accordingly, six-tenths and eight-tenths of K, and B, were arbi- 
trarily subtracted from, and added to Hy. A limited discussion of the optimum 
amount of suppressor correction has been presented elsewhere“ 1°. A correction 
which is optimum for separating a particular clinical group from normals probably 
will not be optimum for separating two clinical groups from each other, or for in- 
creasing the independence of the clinical scales. The fractions selected here serve to 
illustrate some of the effects of using corrected scores. 

The B scale was modeled after the Set T scale of the Opinion, Attitude and 
Interest Survey“ * ©. Scores from the 63 item response bias scale B“ were obtained 
by counting the number of times the test-taker answered true to those items which 
were answered true by 40 to 60 per cent of Hathaway and McKinley’s original 
“normal” sample which has been used in the construction of the other MMPI scales. 


1The writer is grateful to Dr. 8. R. Hathaway for his cooperation in this project. 
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High scores indicate a strong set on the part of the test-taker to answer true to the 
verbal stimuli; low scores indicate the opposite. .Means, standard deviations (SD’s), 
and interscale r’s were computed. 


RESULTS AND DISCUSSION 


Perhaps the most important fact disclosed by Table 1 is that the highest mean 
T-score for the 63 conversion hysterics is 67.54, obtained on Hy. Substantial status 
or concurrent validity for the MMPI is indicated by this finding. The second highest 
T-score of 65.50 on Hs is undesirably high. Perhaps this high mean is to be expected 
in view of the generally high r’s reported between Hy and Hs for clinical populations 
(10, 16, 16) For the 63 hysterics the r between Hy and K-corrected Hs is .82, and 
between Hy and uncorrected Hs, .70. While the K correction has decreased the in- 
dependence of the clinical scales its utility has been adequately demonstrated. The 
median T-score for the nine clinical seales is about 60.0. 

TasBLe 1. MEraANsS AND Sranparp DevIATIONS FoR 63 CONVERSION HysTERIcs 























; Statistic Statistic 
MMPI —_$_$_——__ —___—_ MMPI — 

Variable Mean SD Variable Mean SD 

I ?-T 50.68 2.66 19 Pa-T 55.70 9.56 
3 ? 14.98 16.17 20 PA 9.95 3.49 
3 1-T 58.21 7.63 21 Pt+1.0 K-T 59.35 9.98 
4 L 6.41 2.48 22 Pt+1.0K 30.79 6.16 
5 F-T 54.76 7.67 23 Pt 13.48 9.74 
6 F 4.60 4.48 24 Se+1.0 K-T 61.70 10.53 
7 Hs+.5K-T 65.60 12.90 25 Se+1.0K 30.21 6.90 
8 Hs+.5k 20.97 6.55 26 Se 13.21 9.40 
9g Hs 11.75 7.06 27 Ma+.2K-T 54.63 10.83 
10 D-T 59.19 11.80 28 Ma-+.2K 18.73 4.50 
11 D 25.00 7.74 29 Ma 14.90 5.17 
12 Hy-T | 67.54 11.97 30 K-T 59.2 41.52 
13 Hy 28.71 6.85 31 Ix 17.: 6.26 
14 Pd+.4K-T 60.05 10.45 32 B 26 .f 7.56 
15 Pd+.4k 23.19 4.63 33. Hy+.8B 49 .¢ 8.97 
16 Pd 16.25 4.64 34 Hy+.6B 44 -¢ 8.06 
17 Mf-T 57.35 10.08 35 Hy-S8K 14.89 8.29 
18 Mf 32.89 5.00 36 Hy-.6k | «18.32 7.70 





Next to Wauck’s high mean of 69.40 on the Se scale was the 64.87 on the D 
scale. Wauck reported an r of only .32 between Sc and D. As mentioned previously, 
Wauck did not indicate whether K-corrected Sc was used; the writer assumes, here 
and elsewhere“ * that uncorrected scores have been used unless it is specifically 
stated that K-corrected scores have been used. (For the 63 hysterics the r between 
uncorrected Se and D is .381; between K-corrected Se and D, .26). The median T- 
score for Wauck’s schizophrenics was about 63.0. These comparisons might be inter- 
preted as indicating that the differential diagnostic value of the MMPI for con- 
version hysterics appears to be somewhat less than for schizophrenics. 

It is also of some interest to note that even though the scores of a homogeneous 
sample of hysterics are reported, most of the SD’s (including Hy) in Table 1 are 
larger than those of the normal population. It can also be seen from Table 1 that the 
SD’s of the K-corrected raw scores are generally considerably smaller than the SD’s 
of the uncorrected scores. K-corrected Hy is an exception; perhaps this is because 
it is corrected by the subtraction rather than the addition of a fraction of K. 

It is relevant here to point out that the fraction of K does not directly determine 
or reveal the extent of correction for each K-corrected scale. It is not uncommon to 
hear and to read that defensiveness-plus getting is corrected most in Pt and Se, pre- 
sumably because 1.0 K is added, and least in Pd and Ma because .4K and .2K are 
added. Actually the clinical scales and K are weighted in proportion to their SD’s. 
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Since the SD of uncorrected Pd is considerably less than the SD of uncorrected Hs 
it follows that the K-correction is greater for Pd than for Hs even though .4K is 
added to Pd and .5K is added to Hs. That is, .4 x 6.26 = 2.50 is the SD for K when 
combined with Pd whose SD is 4.63; the variability of IX is 54 per cent as great as 
the variability of Pd. Also .5 x 6.26 = 3.13 is the SD for IX when combined with Hs 
whose SD is 7.06; the percentage here is 44. The percentages for Pt, Sc, and Ma are 
64, 67, and 25 respectively. 

Table 2 shows the interscale r’s for the raw scores. One of the most interesting 
r’s is that of —.73 between K and B. The reliability of the 30 item K scale is rarely 
higher than .73. This finding supports the writer’s contention that IX is essentially 
a measure of the response set to select a particular response category: 7). In this 
connection it should also be noted that the r’s with K-corrected Hy, and B-corrected 
Hy are essentially the same. 

The relation of corrected Hy to K-corrected Hs is of some practical importance; 
the r between uncorrected Hy and K-corrected Hs is .82. When .Sk is subtracted 
from Hy, and .8B is added to Hy, the r’s with K-corrected Hs drop to .66. When 
six-tenths of the suppressor is used the drop in r is less. Although not indicated in 
Table 2 when ten-tenths of the suppressor is used the r is .60. Since 20 of the 33 
items in Hs (uncorrected) overlap with Hy (uncorrected) it is improbable that high 
independence for these scales could be achieved through K or B corrections. The 
other changes in interscale r’s with corrected Hy are probably of less significance. 

Almost without exception the interscale r’s with Hy + .8B are more positive 
than the corresponding interscale r’s with Hy + .6B for those scales in which most 
of the items are scored true (e.g., F, Pt, Sc), and more negative (or less positive), 
for those scales in which most of the items are scored false (e.g., L, K, Hs, D, Hy). 
This is probably to be expected“ 7). A reversed situation is found for K-corrected 
Hy. 

The changes in interscale r’s brought about by the K-corrections for the other 
clinical scales are generally substantial. Certain trends in r’s can be seen. For ex- 
ample the r’s between L and the scales normally K-corrected are consistently more 
positive, or less negative, for the K-corrected scales than for the uncorrected scales. 
The r’s with the K-corrected scores are nearer zero in four of the five comparisons 
(i.e., the r between L and K-corrected Sc is .18; between L and uncorrected Se the 
ris only —.05). 

Perhaps more important than the above is the finding that the r between two 
K-corrected scales is consistently smaller than the r between the two uncorrected 
scales (e.g., the r between corrected Hs and corrected Pt is .38, and the r between 
uncorrected Hs and uncorrected Pt is .55; the r between corrected Sc and corrected 
Ma is .27, and the r between uncorrected Sc and uncorrected Ma is .42). It is ob- 
vious that the dimensions measured by the K-corrected scales are more independent 
than those measured by the uncorrected scales. Those who are interested in factor 
analyzing the MMPI should consider using K-corrected scores rather than the un- 
corrected scores. The reduced r’s are obtained, this writer believes, because the in- 
fluence of response set has been minimized. It is relevant here to mention that usually 
the first factor extracted from the factor analysis“: '* 4 5. 1© of uncorrected scales 
is a “general-serious maladjustment” factor with heavy positive loadings on F, Pt, 
and Sc; these are the scales having most of the scored responses as true. Frequently 
K and L have a negative loading on this factor; almost all of the items on these two 
short scales are scored false. The second factor, “‘expressive-repressive-neurotic,” 
usually has positive loadings on K, L, Hs, D, Hy; most of the items on each of these 
scales are scored false. Frequently Ma has a negative loading on this factor; most of 
the Ma items are scored true. Recently Welsh “* constructed scales to measure these 
two major MMPI factors, the first scale he called A (anxiety), and the second R 
(repression). As might be expected there is a true-false imbalance in these scales; 
38 of the 39 items in A are scored true, and all 40 items in R are scored false. 

While the writer has been unable to locate a correlation matrix for clinical 
cases comparable to that in Table 2, a comparison of the r’s for K-corrected and un- 
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corrected scores for these conversion hysterics with the r’s for K-corrected and un- 
corrected scores for engineering graduates presented by McKinley, Hathaway, and 
Meeh!“) will reveal marked differences. For example the r’s between Hs (K-cor- 
rected and uncorrected) and D are higher for hysterics than for engineers; the r’s 
between Hs (iX-corrected and uncorrected) and Mf are lower for hysterics than for 
engineers. The r’s between Pd (K-corrected and uncorrected) and Hy are much lower 
for hysterics than for engineers. The r’s between Sc (Ix-corrected and uncorrected) 
and Hy are about the same for hysterics but they are widely discrepant for engi- 
neers. There appears to be something in the nature of the groups which not only 
differentially affects the mean scores but also the interscale r’s. 

Cottle and Powell“ have shown similar differences in the interscale r’s for 
uncorrected scales between the scores of college students and the chance scores of 
dice and a table of random numbers. Cook and Wherry“, Wheeler, Little, and 
Lehner“, and Williams and Lawrence“® have presented additional correlation 
matrices for what appear to be uncorrected scales. It is obvious from an inspection 
of all available correlation matrices that the size and sign of the interscale r’s is con- 
ditioned to a considerable extent by the kind of scores used (corrected or uncor- 
rected) as well as the nature of the test-takers. 


SUMMARY 


The MMPI scores of 63 conversion hysterics tested in the University Hospital 
at Minneapolis were presented and discussed. The highest mean score was 67.54, 
obtained on Hy; considerable status or concurrent validity for the MMPI is indi- 
cated by this finding. The second highest mean was 65.60 on K-corrected Hs. This 
mean and the correlation of .82 between Hy and K-corrected Hs is undesirably high. 
The correlation was reduced through the use of K- and B-corrected Hy scores. 

Some of the effects of correlating corrected scores were pointed out. Additional 
evidence to support a response set interpretation of the K scale was presented. 
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PROBLEM 


This paper represents an attempt to explore the relationships between psycho- 
pathology and acceptance of self, parents and people. The relationships between 
these three areas of acceptance play a crucial role in Arieti’s theory of the develop- 
ment of the schizophrenic predisposition”. It is difficult to deduce hypotheses 
directly from Arieti’s theory since he points out many courses that the attitudes 
toward self and parents may take. However, there is some indication that adjust- 
ment should vary directly with acceptance of, or positiveness of feeling toward, 
parents and people. Since attitudes toward people are considered, in part, to be 
generalizations of attitudes toward the parents, acceptance of other people should 
vary directly with acceptance of the parents. 

Self-acceptance has been studied extensively in the form of congruency between 
the self and ideal concepts. Although highly self-accepting people tend to score low 
on the clinical scales of the MMPI: © there is some evidence that they are more de- 
fensive than others. Some external criterion is necessary to evaluate the adjustment 
of these self-satisfied subjects. Are they really ‘‘super-adjusted” or are they just as 
maladjusted as the self-dissatisfied subjects. The defense mechanisms utilized by 
the Ss may be of importance in determining their self-esteem. Bills“) found that 
those who externalize blame tend to have high self-acceptance while those Ss who 
internalize blame tend to have low self-acceptance. The following hypotheses were 
formulated to test these questions: 

1. Schizophrenics should show less acceptance of parents and people than 
non-schizophrenic patients. Non-schizophrenic patients should show less acceptarice 
of parents and people than normals. 

2. Within the patient group there should be a significant relationship between 
acceptance of parents and people and degree of psychopathology. 

3. Assuming that paranoids and depressives represent the extremes of the 
externalization-internalization mechanism for dealing with hostility, paranoids 
should be more accepting of self and less accepting of other people than depressives. 

4. Acceptance of people should vary directly with acceptance of parents in 
both normal and patient groups. 


MetTHop 


In this study we used the ideal concept as a convenient base-line for “acceptance.” 
The smaller the discrepancy between the self-concept and the ideal-concept, the 
more we say the individual accepts himself. This is a conventional definition of self- 
acceptance, or self satisfaction. We also used the ideal concept as the baseline for 
acceptance of parents and people. The more the concept of mother, for instance, 
approaches the individual’s concept of his ideal, the more we say he accepts her. 
Actually this difference between the ideal and a parent or people could also be called 
a measure of ‘identification aspiration’. The person is saying that he would like to 
be like his parent regardless of what he is actually like. We assume this also carries 
the connotation of acceptance. It must be remembered that when we speak of 
“acceptance” we are referring to the conscious or admitted acceptance of our opera- 
tional definition. A patient might over-idealize his parent as a defense against ad- 
mitting his unconscious rejection of the parent. In this study we are only concerned 
with the surface phenomena. 

The scale used to measure conceptions of self, ideal, parents and people was 
adapted from a scale developed by Buss at this hospital. It consists of 16 sub- 
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scales covering clinically relevant personality dimensions. In each sub-scale a list of 
scaled adjectives describes various intensities or extremes of the dimension. There 
are eight words defining each sub-scale. 

Each 8 made five sets of ratings on this scale: Self (S), ideal (1), mother (Mo), 
father (Fa), and people in general (P). For each 8, four discrepancy scores were 
computed by subtracting the numerical values of the ratings on the ideal concept 
from the numerical values of the ratings on each of the other concepts. The signs of 
the differences between two concepts were ignored and the summed differences were 
the discrepancy scores. The larger the difference between ideal and any of the other 
concepts, the more dissimilar are the conceptions, and the less accepting we say the 
individual is. Acceptance, as defined, varies inversely with the size of the dis- 
crepancy score. 


Subjects. All Ss were between the ages of 17 and 45. All had lived with their parents 
or step-parents for at least the first 12 years of their lives. All potential Ss were given 
the Shipley Hartford (S-H) vocabulary scale. In order to eliminate those Ss without 
sufficient verbal ability to use the adjective scales, only those Ss scoring at an average 
(raw score 25) or higher level were used. 

There were 60 normal Ss, 30 males and 30 females. These Ss consisted mainly 
of attendants, nurses, clerical personnel, medical students and a few members of 
church social groups. The mean age in this group was 27; the mean S-H score was 
32, equivalent to an IQ of 118. Education was not recorded. 

The patient group consisted of 30 non-schizophrenic patients (NScP) and 30 
schizophrenic patients (ScP). Each of these groups of 30 consisted of 15 males and 
15 females. The non-schizophrenic group consisted mainly of psychoneurotics, 
personality trait disorders, and sociopathic personalities. The mean age in this group 
was 30; the mean S-H score was 30.5, equivalent to an IQ of 114; mean education 
was 12 years. 

Although all Ss were selected for above average intelligence (S-H) the patient 
and normal groups are not strictly equivalent on all variables. However the non- 
significant correlations between age, education and vocabulary score and the dis- 
crepancy scores, presented in table 1, make it unlikely that these three variables 
could account for the large differences found between normal and patient groups. 

TABLE 1. CoRRELATION WITHIN PATIENT Group BETWEEN Epucation, AGE, 8 - H 
VocABULARY SCORE AND THE DIscREPANCY Scores AND Case History Ratincs 








: Discrepancy Scores 
Factors S vs. I I vs. P I vs. Mo 


Education — .069 — .167 — .132 
Age — .025 — .122 — .127 
Vocabulary — .174 —.101 — .018 





Case History Rating. Since our hypothesis postulates that the acceptance of parents 
and people will vary directly with adjustment, it was important to show a relation- 
ship within the patient group in addition to the gross diagnostic differences based on 
comparisons of normals and patients. Two authors independently rated the final 
case summaries along a 7 point scale of adjustment. Inter-rater reliability for the 60 
cases was .805, significant beyond the .01 level. The ratings of the two judges were 
averaged to get the Case History Rating (CHR). 


RESULTS 


Table 2 gives the correlations between the CHR for adjustment and the dis- 
crepancy scores. For the group of patients as a whole the relationships between 
acceptance of people (I vs. P) and acceptance of father (I vs. Fa) are significant. The 
worst adjusted patients show the least acceptance of people and father, as predicted. 
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TABLE 2. CorrELATIONS IN Patient Group BETWEEN DISCREPANCY SCORES AND CASE 
History Ratincs 


Discrepancy Scores 
S vs. I 1a. F 


Factors I vs. Fa 


I vs. Mo. 








.054 .085 
.125 .324 


-082 .252* 


.143 .245 
— .355* 341 


— .099 .276* 


Men 
Women 
Both 





*Significant below .05 level — two tailed test. 


The relationship with acceptance of mother was not significant. The female patients 
show a negative relationship between self acceptance and adjustment. The worst 
adjusted females were the most self-accepting. It should be noted that partialling out 
the relationships found in table 1 would have the effect of raising the positive cor- 
relations in table 2 and lowering the negative ones. 

Group differences are presented in table 3. The first 3 rows give the means of 
three diagnostic groups in the analysis of variance. In all cases the F value for diag- 
nostic groups was significant. For acceptance of people (ideal vs. people) and ac- 


TABLE 3. DISCREPANCY SCORE DIFFERENCES BETWEEN DIAGNOSTIC AND SEX GROUPS 


Self vs. 
Ideal 


Ideal vs. 
Father 


(i - Fa) 


Ideal ps. 
Mother 
(I - Mo) 


Ideal vs. 
Groups People 


M Norm (N = 60) 78. 83. 110. 





M N&cP (N = 30) | 187. 


I 
t Norm vs. NScP | 8 
t Norm vs. ScP | 8. 
t NScP vs. ScP 
M Males (N = 60) 

M_ Females (N = 60) 


M ScP (N = 30) 177. 
* Diagnostic Groups | 46: 


133.6 


103. 
145. 


6 
1.3% 
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1.72 


119.¢ 


150. 
153. 
4. 


149. 


SOO Sh 
** * 


106.2 


139 
174 


9 
me § 


8. 


9 


a 


4. 


¥. 
140. 
123. 


0) 


ve 


4 
1 


OO STS TO or 


126.8 87.: 112. 
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170.8 79. 116. 
176. 169. 159 
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F Sex 

M_ Depressive (N = 14) 

M Paranoid (N = 18) 

t Dep. vs. Pa. | 


1.61 
110.5 
209.3 

3.12* 


oo 


to 
bo 





*Significant .025 level for comparisons between patients and normals or significant below 
.05 level for comparison between patient groups. All “t’”’ tests are two tailed. 


ceptance of father (ideal vs. father) the mean discrepancy score of the non-schizo- 
phrenic patients falls midway between the mean scores of the normals and the mean 
scores of the schizophrenic patients. This tends to bear out our hypothesis although 
the variability within groups blurs the distinction between adjacent group means. 
For acceptance of self (self vs. ideal) and acceptance. of mother (ideal vs. mother) 
both of the patient groups show significantly less acceptance than the normals but 
do not differ from each other. Females show greater acceptance of people and greater 
acceptance of mother than do males. None of the interaction effects between sex and 
diagnosis were significant. 

The contrasts between the patients diagnosed as depressives and the patients 
diagnosed as paranoids can be found in the last three rows of this table. The para- 
noids show less acceptance of people than the depressives, as predicted. The pre- 
diction that the paranoids would show more acceptance of self is not borne out. In 
addition, the paranoids show significantly less acceptance of father than the de- 
pressives. 

On the basis of a study of self-satisfaction in college students, Block and Thomas 
) predicted that self-satisfaction (defined in the same way as self-acceptance) would 
be a prognostically poor sign in hospital patients since these patients would be 








330 MARVIN ZUCKERMAN, MARC BAER AND IRWIN MONASHKIN 


“adapted but not adaptable’’. As an incidental result we tested this proposition 
using the hospital rating on outcome of hospitalization. The results are given in 
table 4. In contradiction to the prediction of Block and Thomas our self-satisfied 
group (low self-ideal discrepancy) showed the most improvement while in the hos- 
pital. 


Tasie 4. Se_tr AccEPTANCE AND HospiTaL OvuTCOME 


Factors Unimproved Improved + Total 
Recovered 


Low 8 - I discrepancy 2 18 20 
Medium § - I discrepancy 10 10 20 
High 8 - I discrepancy* 6 13 19 
Total 18 1} 59 
chi-square 7.56 
df 2 p below .05 


*one patient in this group died before outcome was determined. 


The data bearing on our last hypothesis are given in table 5. Acceptance of 
people does vary directly with acceptance of parents in both patients and normals. 
Acceptance of self also varies directly with acceptance of parents with the exception 
of the relationship between acceptance of mother and self-acceptance in the patient 
group. 

TABLE 5. CorrRELATIONS BETWEEN ACCEPTANCE OF SELF (S - I) 
ParEnts (I - Mo, I - Fa), AaNp PEeop_e (I - P) 


Factors Normals (N = 60) Patients (N = 60) 
S-I x I-Mo 6%" 19 
S-I x I-Fa 46* .40* 
S-I x I-P Pg .o8* 
I-Mox I-Fa BY foe .50* 
I- Mo x I-P .44* on 
I-Fa x I-P .40* 45” 





*significant below .01 


DIscUSSION 

Self-acceptance does not seem to be an accurate index of adjustment within the 
patient group although the patients as a group are more dissatisfied with themselves 
than normals. Self-acceptance probably is influenced by mechanisms like denial 
which are used by patients of all degrees of pathology; by hysterics as well as para- 
noids, for instance. Self-acceptance does seem to have some prognostic value in that 
it may indicate that repressive defenses are intact enough so that the patient may re- 
integrate during his hospitalization. 

Acceptance of others seems to be a better index of adjustment. At the extreme of 
non-acceptance it may reflect pathological mechanisms like projection. The differ- 
ence between the paranoids and depressives is evidence for the interpretation. Like 
the character in the Steig cartoon, our sickest patients seem to say “people are no 
damn good”’. 

Women showed significantly more acceptance of people than men. Berger? 
reports a similar sex difference in acceptance of others. This finding may be a con- 
sequence of role differences. Men in our culture tend to be more competitive and 
aggressive than women. If they are too accepting, or identify too closely with all 
people, they cannot fully maintain this role. 

Acceptance of father seems to relate more directly to adjustment than accept- 
ance of mother although patients on the whole are less accepting of mother than 
normals. Identification, defined as similarity between self and parent descriptions, 
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has been shown to relate to adjustment in college students. Sopchak “? found that 
failure to identify with the father is more clearly associated with trends toward ab- 
normality than is failure to identify with the mother. Arieti“? points out that the 
child is more likely to repress the ‘‘bad”’ image of the mother than the bad image of 
the father or other people. He does this in order to protect himself against perceiving 
a hopeless situation in which the main source of affection is irrevocably shut off. If 
it is true that the mother plays a crucial role in determining adjustment then we 
must use more subtle techniques to get at this ‘“‘repressed” image, or evaluate the 
mother directly. 

Women show significantly more acceptance of mother than men. This finding 
is understandable in terms of our alternate definition of this score as a measure of 
“identification aspiration”. Women want to be more like mother than men, and 
accept her personality pattern more for this reason. It is not clear why the converse 
was not found, 7.e., why men did not show greater acceptance of father than women. 
One may question whether the lack of acceptance of the father, or lack of desire to 
identify with him, is a function of his real personality or a function of the sicker 
patients’ distorted attitudes toward people in general. Does the patient generalize 
from father to people, or from people to father? Our data indicate that there is 
generalization but we cannot determine the direction. The fact that paranoids are 
most extreme in their rejection of father may indicate that the parental rejection may 
be, in part, a function of the projective mechanism. Ultimately this type of question 
will be more adequately probed by coordinated studies of the children and parents. 
We are undertaking such studies at present. 

According to Arieti® the child learns to accept himself, as a consequence of his 
parents acceptance of him, then generalizes his acceptance to parents and to people. 
The intercorrelations between the measures of acceptance in table 5 support the 
possibility of this type of generalization of acceptance. 


SUMMARY 


This study was undertaken to explore the relationships between psychopathology 
and acceptance of self, parents, and people. The subjects were 60 hospitalized psy- 
chiatric patients and 60 normals. Concepts of self, ideal, mother, father, and people 
were measured on self-rating adjective scales. Acceptance scores were defined as the 
discrepancies between the ideal concept and each of the other four concepts. Degree 
of adjustment within the patient group was evaluated by (a) a breakdown of this 
group into schizophrenic and non-schizophrenie groups, (b) a rating of adjustment 
based on the final case summary. A comparison of paranoids and depressives was 
made, on the assumption that these groups represent the extremes of externalization 
and internalization of hostility. 


1. Patients showed less acceptance of self than normals but this measure did 
not relate to degree of adjustment within the total patient group. In female patients, 
acceptance of self was related negatively to adjustment. 

2. Patients showed less acceptance of people than normals. This measure was 
related to adjustment within the patient group. Paranoids showed less acceptance of 
people than depressives. 

3. Patients showed less acceptance of mother than normals but this measure 
did not relate to adjustment within the patient group. 

4. Patients showed less acceptance of father than normals and this measure 
was related to adjustment within the patient group. Paranoids showed less accept- 
ance of father than depressives. 

5. The measures of acceptance of parents correlated with acceptance of people 
in both normal and patient groups. 


The results were discussed with reference to Arieti’s ideas on the development of 
the schizophrenic predisposition. 
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INTRODUCTION 


This study of relationships between stimulus structure and ambiguity was 
prompted by problems in personality evaluation using projective techniques. Fre- 
guent responses, such as Popular responses on the Rorschach, usually have limited 
personal reference while infrequent responses are richer in personal meaning for the 
subject. Frequent responses are primarily stimulus-determined and infrequent 
responses are primarily need-determined. Also in a normal population frequent or 
stimulus-determined responses to test materials of limited structure appear to be less 
ambiguous, more definite, to the subject than infrequent responses. In order to 
assess the degree of personal significance of ambiguity of response it is essential to 
rule out stimulus effects. The purpose of this study is to determine if degree of 
ambiguity of response is a significant function of degree of stimulus structure in a 
group of normal subjects. 

Anagrams were used as the stimulus material because of their convenience in 
determining and manipulating structure and ease of scoring accurate solutions. 
Anagram devices have been used in several studies of mental set: 4 ® 7) but they 
have not been studied systematically in regard to their value as a projective stimulus 
technique. A secondary purpose of this study is to obtain some additional informa- 
tion about multiple solution anagrams. 


OPERATIONAL DEFINITIONS 


Stimulus structure and ambiguity may be defined and measured in many ways. 
The definition of stimulus structure ultimately involves the opinion of judges. Fre- 
quency of agreement of judges as to the meaning of a stimulus is our best estimate of 
degree of structure. If there is high agreement among the judges, the stimulus is 
highly structured. If there is little agreement, there is little structure. Degree of 
structure of an anagram is computed from the relationship of the frequency of oc- 
currence of a given solution to the anagram to the number of times this anagram is 
solved. An anagram is considered solved when its letters are rearranged into either 
of the two possible meaningful words for that anagram. For example, anagram 
“irgn”’ could be solved as either “ring” or “grin”. Should solution “ring” be given 
by 50% of the subjects, and “grin’’ by 50° of the subjects, anagram “‘irgn’’ would be 
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structured at the .50 level. Should the solution “ring’’ be given by 90% of the sub- 
jects and “grin” by 10°, anagram “‘irgn’’ would be structured at the .90 level. In 
this paper, structure always is stated in terms of the more frequent of the two 
solutions. 

Ambiguity, as used in the paper, relates to a perceptual characteristic and is 
thought of as the degree of structure an individual attributes to a given stimulus. 
Ambiguity varies from a clear perception, where the stimulys means only one thing, 
to a more vague perception, where the meaning of the stimulus is more doubtful and 
has more than one possible meaning or even no meaning at all. Degree of ambiguity 
is defined as the frequency with which alternative meanings are given to a stimulus. 
Ambiguity was measured by the frequency of change of solutions from Trial 1 to 
Trial 2 on the anagrams having two possible solutions. Degree of ambiguity could 
be measured in terms of spontaneous changes and/or changes induced by intervening 
variables such as clues. Both methods were utilized in this study. The purpose of 
using clues was to induce changes in performances artificially. Differential changes 
in performance from the clues were then studied with respect to anagram structure. 

Four experimental hypotheses regarding the relationships between ambiguity 
and stimulus structure were made. They are: 

1. Changes from a frequently given response to an infrequently given response, 
occurring spontaneously or from clues, tend to occur more often in less structured 
anagrams than in anagrams of higher structure. 

2. Changes from an infrequently given response to a frequently given response 
should not be related to anagram structure since infrequent responses are assumed 
to be need-determined rather than stimulus-determined. 

3. Frequently chosen responses tend to be maintained more often in highly 
structured anagrams than in less structured anagrams. 

4. Infrequently chosen responses tend to be maintained more often in less 
structured anagrams than in anagrams having higher structure. 


MATERIALS AND Mretuop (PrRe-EXPERIMENTAL PROCEDURE) 


Two lists of anagrams containing a total of 139 multiple solution anagrams were 
individually administered to 60 pre-experimental subjects. The pre-experimental 
subjects were all 18 years of age or older, had an eighth grade education or better, 
and had never been hospitalized for mental illness. They were heterogeneous in re- 
gard to age, sex, race, intelligence and education. Anagram solutions and time re- 
quired were tabulated for these subjects. On the basis of these results two sets of 
four letter anagrams, each with two possible solutions, were selected. The following 
criteria were used in the selection: 

a. Frequency of solution of the anagram. Anagrams correctly solved by 

90% of the subjects were used. 

b. Stability of structure of anagrams. Anagrams varying greatly in struc- 
ture with the age, sex, race and educational differences were discarded. 

c. Structure of anagrams. Anagrams were paired by matching them as to 
degree of structure. A wide range of structure between anagram pairs was 
desired. 

d. Familiarity of anagram solution words. Frequency of occurrence of an 
anagram solution was compared with the Thorndike word count list which is 
based on frequency of occurrence in printed matter®. No significant relation- 
ship between frequency of anagram solution and frequency of usage of that 
word in printed matter was found. This is consistent with the findings of Rees 
and Israel“. They found that frequency of anagram solution was significantly 
associated with the arrangement of the letters, structure, within each anagram. 


These operations resulted in 20 pairs of anagrams, matched in structure, with 
structure scores ranging from .50 to .99. Two sets of anagrams each containing 20 
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anagrams were then established. These are labeled Anagram Set A and Anagram 
Set B. These 40 anagrams selected for investigation were then inserted into a list of 
60 additional anagrams, making a list of 100 anagrams, to reduce practice effects. 
The order of occurrence of the experimental anagrams in the list was determined 
randomly with the restriction that they could not occur in the first 10 anagrams or 
after the 60th anagram in the list. On the basis of the performances of the pre- 
experimental subjects a time limit was set for the list at seven minutes. This was de- 
signed to allow all subjects to complete the first 60 anagrams but not complete all 
100 anagrams. 

Next, clue words were selected for each of the 40 experimental anagrams. The 
clue word for each anagram was always the least frequently given solution to the 
anagram by the pre-experimental group of subjects. For example, if on anagram 
“irgn’’, the solution ‘‘ring’”’ were given more frequently, then ‘‘grin’’ would be the 
clue word. The clue words were randomly arranged into Clue Word List A for Ana- 
gram Set A and Clue Word List B for Anagram Set B. 


EXPERIMENTAL PROCEDURE 


A total of 68 student nurses were tested as a group. Each subject was presented 
a booklet containing pages in the following order: Directions, the list of 100 anagrams, 
a list of 20 clue words, a page for writing down clue words memorized and finally 
a duplicate list of 100 anagrams. The directions for the first trial of the anagrams 
were as follows: “‘This is an experiment involving the use of anagrams. Anagrams 
are a series of scrambled letters which need to be rearranged into sensible words. 
For example: 

1. The anagram astk rearranged is task. 

2. The anagram pdese rearranged is speed. 


The object of the experiment is to solve as many anagrams as you can by making 
them into sensible words. Remember to use only the letters given in each anagram. 
Do not add or omit any letters in the anagrams. Some anagrams may have more 
than one solution, but do not worry about that. Just try to solve as many anagrams 
as you can in the time allowed. You will have seven (7) minutes. Do you have any 
questions?” 

After the subjects finished reading the directions along with the experimenter, 
they turned the page and worked for exactly seven minutes solving the 100 anagrams. 
Next, the subjects turned to the page with the 20 clue words and were given the fol- 
lowing directions: “Here is a list of words which are solutions to the anagrams. Mem- 
orize them the best you can during the next three (3) minutes. I want to see if they will 
help you improve your score on the next trial on the anagram list.’’ The 34 subjects 
receiving odd numbered booklets had Clue Word List A to memorize and were 
labeled Group I. The 34 subjects receiving even numbered booklets had Clue Word 
List B to memorize and were designated Group IJ. Thus, for Group I Anagram 
Set A served as experimental anagrams and Anagram Set B served as control ana- 
grams. For Group II, Anagram Set B contained the experimental anagrams while 
Anagram Set A was used as a control set. ; 

Following the three (3) minute period of memorization of the clue words the 
subjects turned to the next page and were allowed two minutes to write down as 
many of the clue words as they could recall. Immediately thereafter the subjects 
turned the page to the second trial of the list of 100 anagrams. The directions were: 
“Solve as many of the anagrams as you can.” They were again allowed seven (7) 
minutes for the second trial of the 100 word anagram list. 


RESULTS 


The degree of structure of each anagram in Anagram List A and Anagram List 
B was computed separately for the performances of Groups I and II using the same 
formula as in the pre-experimental group. On Trial 1 of the Anagram Sets A and B, 
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the performance of Groups I and II were similar to each other and to the pre-experi- 
mental group in terms of range and distribution of anagram structures. The overall 
character of the performances of Groups I and II were also similar to each other on 
Trial 2. There were some divergences between Group I and II on structures of in- 
dividual anagrams on Trial 1 but were no greater than could be expected on a chance 
basis. 


Section I. Table 1 compares the effects of the clue words with the proportion of 
shifts in response from Trial 1 to Trial 2. Separate X* comparisons are made for 
Group I and Group Il. The proportion of shifts in response is a comparison of the 
frequency of a particular shift with the number of opportunities for occurrence of 
that shift. For example, the proportion of I! - I? shifts is the number of shifts from 
frequent responses on Trial 1 to infrequent responses on Trial 2 divided by the num- 
ber of frequent responses on Trial 1. Table 1 indicates that the clues to the infre- 
quent solution effectively increased the number of shifts from frequent to infrequent 
responses for both groups. The level of confidence is .02 for Group I and .01 for 
Group II. 


TABLE 1. PROPORTIONS OF SHIFTS IN CLUED AND NON-CLUED CONDITIONS 











Group I Grove IT 
Proportions of shifts Proportions of shifts 

Types |—————— a 2 

of Clued Non-Clued X? values* Clued Non-Clued | X? values* 
Sh fts (Anagram (Anagram of | (Anagram (Anagram of 

Set A) Set B) difference | Set B) Set A) | difference 
Fi - |? 138 | 071 5.55 (2% 206 .058 | 22.22 (1%) 
pi- F2 250 307 42 172 308 =| 4.09 (5%) 
Fi - F2 | 821 865 385 | 400 871 | 2.88 
-7 | .696 .634 B85 808 .608 | .82 
| 
F = frequently given solution. 


I infrequently given solution. 
Subscripts ; and 2 refer to Trials 1 and 2, respectively. 
*Yates correction continuity was employed wherever applicable. 





Clues to the infrequent solution effectively (.05 level) decreased the number of 
infrequent to frequent response shifts in Group II. They were not effective in re- 
tarding the shift in Group I. This difference is accounted for by the difference between 
Groups I and IT in the frequency with which clue words were memorized. Group II 
learned more clue words (.01 level) than Group I. Clues tended to decrease the stab- 
ility from Trial 1 to Trial 2 of the frequent responses. Conversely, clues tended to 
increase the stability of the infrequent response. Neither of these trends is statistical- 
ly significant. 

Since the trends were similar in Groups I and II and the anagram sets had been 
reversed between the two, the anagram sets are considered equivalent. It appears 
then that, although clues did not alter responses to the anagrams a significant 


amount, they were effective in increasing and/or reducing ambiguity of response to 
the anagrams. 


Section IIT. The comparison of degree of structure with ambiguity, frequency of 
response shift, was initially made separately for the clue and non-clue conditions 
within Group I and also within Group II. Thus, four analyses were done. Anagrams 
were dichotomized at the median in terms of degree of structure and proportion of 
shifts. Two by two contingency tables were employed using the X? technique as the 
test of significance. These analyses indicated consistent trends among the four 
samples. The degree of statistical significance in each sample was restricted by its 
size. Since the number in each sample was 20 and this is the minimum for this 
statistical method, the four samples were combined to obtain a composite picture of 
the findings. In order to combine the four samples, differences in samples related to 











336 JACK G. WIGGINS 


the effects of clues and to differences between the two groups of subjects had to be 
considered. Deference was given to clue effects and group variations by maintaining 
the medians of structure and proportions of shifts in each of the samples. The overall 
analysis was then made by adding the respective cells of the contingency tables in 
each of the samples. Thus, the composite tables were based on an N of 80. Chi 
Square values were determined for each kind of shift. 

A significant relationship (X* = 18.05) beyond the .01 level of confidence was 
found between stimulus structure and frequent to infrequent shifts. This confirmed 
Hypothesis 1 that changes from a frequently given response to an infrequently given 
response, occurring spontaneously or from clues, tend to occur more often in less 
structured anagrams than in anagrams of higher structure. No significant relation- 
ship (X? = .80) between anagram structure and infrequent to frequent response 
shifts was found. This result supports Hypothesis 2 that changes from an in- 
frequently given response to a frequently given response should not be related to 
anagram structure. This indicates that infrequent responses to the anagrams are not 
primarily stimulus-determined, hence, are probably need-determined responses. 

Frequent responses were found to be significantly more stable at the .01 level 
of confidence (X* = 8.45) in the anagrams of higher structure. This confirms Hy- 
pothesis 3 that frequently chosen responses tend to be maintained more often in 
highly structured anagrams. Infrequent responses were more stable at the .01 level 
of confidence (X? = 9.81) in anagrams of lower structure. This supports Hypothesis 
4 that infrequently chosen responses tend to be maintained more often in less 
structured anagrams than in anagrams of higher structure. 


DIscUSSION 


The results of this study will be discussed in terms of broader implications cog- 
nizant of the danger of overgeneralization. The major finding is that ambiguity of 
response to anagrams is significantly related to degree of anagram structure. The 
exception to this is when an individualized (infrequent) response is changed to a con- 
ventional (frequent) response. This change is not related to the degree of anagram 
structure. It is assumed that this is a function of the strength of personal need which 
is independent of stimulus structure. 

An implication of these findings might be that, when evaluating ambiguities in 
form quality on the Rorschach by various proposed bases, : ?: * ®) it is necessary to 
consider the degree of structure of the blot or ‘‘card pull’’. In order to assess more 
precisely the degree of individualized ambiguous reaction, the effects of ‘‘card pull” 
for various blot locations might be established by extending Beck’s“? method of de- 
termining normal details. 

Clues were found to be effective in increasing or reducing ambiguity as meas- 
ured by changes of response. Since the relationships between structure and ambig- 
uity were essentially the same in the clue and the control conditions, it would appear 
that there was little differential effect of stimulus structure on the influence of the 
clues. For example, it might be hypothesized that examiner influence upon a pro- 
jective test is approximately the same regardless of degree of “‘card pull’. 

Another implication is that multiple solution anagrams may have some applica- 
tion as a projective screening device. Further investigation of anagrams is planned. 


SUMMARY AND CONCLUSIONS 


The relationship between stimulus structure and ambiguity was studied using 
anagrams with two possible solutions. The effect of clues upon ambiguity was also 
investigated. It was found that: 

1. As stimulus structure increased, ambiguity of response tended to decrease. 

2. When responses were not stimulus oriented, the degree of ambiguity was 
not related to the degree of stimulus structure. 


3. Clues influenced the degree of ambiguity of response but did not significantly 
alter the overall stability of performance. 
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SHIFTS IN THE LEVELS OF OPERATING DEFENSES INDUCED BY 
BLURRING OF INKBLOT SLIDES! 
ANTHONY R. KRAUS 
New Hampshire State Hospital, Concord, N. H. 


PROBLEM 

Widely accepted Rorschach theory states that the perceptually unstructured 
inkblots create ‘“‘anxiety’’ and subsequently activate the subject’s defensive efforts 
‘. 10). The purpose of this experiment is to study the changes brought about in the 
defensive struggle by rendering the ‘‘unstructured”’ stimulus still more unstructured. 
The hypothesis was that the blurred exposure of inkblot slides might activate deeper 
unconscious processes. The objective increase of difficulties in articulation of the 
stimulus material might increase the amount of anxiety and lead to the mobilization 


of different defensive operations “>. " 


PROCEDURE 

The subjects of Group A were a class of 57 Affiliate Student Nurses at the New 
Hampshire State Hospital. All were female, high school graduates with at least one 
year of nursing training with ages in the late teens and early twenties. The class had 
been previously exposed to the Harrower Multiple Choice form of the Rorschach 
plates. 

The projector used was the American Optical Company’s Delineascope, Model 
V. This instrument’s focusing lens in this experiment was 21'2” from the screen. 
The three siides of the Zulliger inkblot test used in this experiment represent a con- 
densed, though representative and comparable short form of the Rorschach ink- 
blots “*). Slide I is dark in tone, black with different shades of gray and white. It is 
constructed to combine the qualities of Rorschach Cards I, IV, V, and VI. Slide II 
is all colored, comparable to Rorschach Cards VIII, IX, and X. Slide III is black 
and red with rather apparent human figures, representing the stimulus value of 
Rorschach Cards IT, I], and VII. 


1Department of Psychiatric Research, New Hampshire State Hospital. The writer wishes to 
express his appreciation to Professor George M. Haslerud, University of New Hampshire, the Re- 
search Consultant to the New Hampshire State Hospital; grateful acknowledgments for assistance are 
extended to Mr. Walter Steinman, psychological intern, who made the statistical analyses, and to 
Mr. Lars V. Henrikson, psychologist, who provided the initial idea for the experiment and admin- 
istered the slides. 
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The class was divided into two groups by having alternating columns of 27 
individuals leave the classroom. Those remaining 30 constituted the first group. 
The three stimuli were consecutively presented to the first group in maximally blur- 
red and then in clear focus. The first group was dismissed and the second group 
took their seats, sitting one seat apart as did the first group. The second group then 
was presented with the three stimuli consecutively in minimally blurred and then 
in clear focus. Maximal blurring was effected by moving the focusing lens outward 
44 millimeters from the clear position. Minimal blurring was effected by moving the 
focusing lens outward 32 millimeters from the clear position. 

Each student of each group was given six blank sheets of paper on each of which 
she recorded her name and stimuli-identifying numerals 1 through 6; 1, 2 and 3 in- 
dicating the blurred exposures and 4, 5 and 6 the clearly focused exposures. The 
examiner read the following instructions: “Sometime ago I showed you a Multiple 
Choice Form of the Inkblot Test. Today, I am going to show you a free choice form 
of the Inkblot Test, in which YOU make up the answers. You will see on the screen 
six inkblot pictures. Your task is to write down what these inkblots, or any parts of 
them, resemble or look like to you. You will see each inkblot for three minutes.” 
Each paper was collected before the next slide was exposed. 

In order to check on the validity of the findings of this experiment it was re- 
peated with another class of Affiliates who had not been before exposed to the Multi- 
ple Choice Rorschach slides. This second group, Group B, consisted of 54 Student 
Nurses. The experimental procedure was strictly duplicated. In the instructions, of 
course, the reference to the Multiple Choice Test was omitted. The minimum blurred 
exposition was abandoned in the second experiment because it did not yield any 
specific results other than trends in the direction of the maximum blurred exposition. 


RESULTS 
The statistical analysis of the results of both experiments is summarized in 
Tables 1-4. The blurred exposition of the same stimulus elicited on all three slides 
a significantly lower number of responses (p < .01), see Table 1. 


TABLE 1. CoMPARISON OF NUMBER OF ReEsponsEs (R) GIVEN TO THE BLURRED AND CLEAR 
PRESENTATIONS OF THE STIMULUS SLIDES, FOR EXPERIMENT A (N =30) AND EXPERIMENT B (N =54) 








R Mean Mean 8.D. 


8.D. 
Slide Exp. Blurred Clear Blurred Clear Blurred ) 


ear 





— ¥ ee 


.68 
84 


( 
l 
] 


A 78 110 2.60 3.45 3.27 
B 99 124 1.85 2.48 1.22 


76 100 2.28 3.33 1.20 1.68 
113 135 2. 2. 15 1.37 
86 112 2.8 3.73 43 1.24 
115 162 2.1% 2.98 97 62 
Total A 240 322 : 10.70 78 
Total B 327 434 ).16 8.23 2.97 





*p<.01 The significance of all p-values was determined with 1 df. 
** Ol<p<.02 


TaBLe 2. CoMPARISON OF THE ToTaAL NUMBER OF VISTA RESPONSES ELICITED ON THE BLURRED 
vs. CLEAR PRESENTATIONS 











Blurred 
Non- I 
Exp. Vista Vista Vista Vista XxX? 





A 190 50 319 3 1161.68 
B 233 94 416 18 525.58 





The significance of all p-values was determined with 1 df. 
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TABLE 3. COMPARISON OF THE TOTAL NUMBER OF ANIMAL RESPONSES ELICITED ON THE BLURRED 
vs. CLEAR PRESENTATIONS 


Blurred 
Non- Non- 
Slides Exp. Animal Animal Animal 





Animal X? 


SI, IT, 111 A 223 17 197 2! 103.68 
XI, I, Il B 316 11 242 9% 219.50 





The significance of all p-values was determined with 1 df. 


There is a statistically significant increase in the number of Vista responses in 
the Blurred presentation (Table 2, p <.01), resulting in a change of content. While 
the clear picture yielded the usual high percentage of Animal responses (in Experi- 
ment A 50°, in Experiment B 62°; of the total number of responses were A1.imals), 
this type of response was nearly eliminated in the Blurred exposition. The Chi- 
Square test was applied to determine the operation of a factor other than chance in 
the eliciting of fewer animal responses and yielded results beyond the .01 level of 
confidence (Table 3). The observed change from Animal content to practically no 
Animal content is of special interest. The Animal category has been found to be the 
most stable variable in other studies with experimentally induced variations in the 

torschach performance “?. 

Especially in the blurred presentation of Slide I, the animal content was re- 
placed by the category “‘house,”’ which is a relatively rare content in the Rorschach 
situation, and Zulliger“*?’ does not even mention this content category in his enumer- 
ation of responses elicited in his experiments. In Group A, 47 per cent and in Group 
B, 40 per cent of the total number of responses were ‘‘houses”’, with a wide variety 
of detailed elaborations. 

The all-colored Slide II elicited an increased emphasis on Color reactions in the 
blurred presentation: The experimental Group A gave 22 color-determined responses 
on the clear exposition while the blurred presentation of the same stimulus slide 
elicited 40 color-determined reactions. Similarly, experimental Group B gave 29 
on the clear, 47 on the blurred exposition. The difference between the observed and 
expected frequencies for both groups, as determined by the Chi-Square test (X? = 45, 
60), is significant beyond the .01 level of confidence. Furthermore, a statistically 
significant increase in the use of “red” as a determinant was observed in the Blurred 
presentation (Experiment A, Slide II: Clear, 9°% of the total number of responses, 
Blurred, 289%; Experiment B, Clear, 7%, Blurred 28°); X? for A, 40.09, for B, 
44.29; with 1 df, p <.01). The percentage of ‘green’ as determinant showed a 
statistically insignificant increase (Experiment A, Clear 10°%, Blurred 12%; Experi- 
ment B, Clear, 10°%, Blurred, 179%). Qualitative analysis of the distribution of the 
color-reactions reveals a pronounced tendency from primarily form-determined color 
responses (FC) in the Clear exposition to primarily color-determined (CF) and in- 
creasingly cruder (C) color interpretations in the Blurred presentation. While the 
Clear exposition elicited responses like “green bushes’’, “‘trees’’, “leaves”, ‘colored 
flowers’’, ‘‘red objects’’, ‘Chinese painting,” we find in the Blurred version “‘rain- 
bow’’, “sunset”, ‘explosion’, ‘‘fire’’, ‘‘bright colors’, “mess of blurred colors’, “lip- 
stick smear’’. 

Slide III lends itself readily to the production of Movement responses. Al- 
though there is a reduction of total output there is an increase in the number of M 
responses in the blurred exposition. Qualitative analysis of the Movement responses 
reveals a shift of emphasis from ‘‘active’”’ toward “inhibited” and blocked move- 
ments, while the number of inactive or static M’s remains relatively constant. Ex- 
amples of “inactive” M’s are human figures described as ‘‘standing’’, “‘sitting’’, 
“talking’’, “looking’’, ‘leaning’, etc.; ‘‘active’’ M’s are dancing, marching, skating, 
walking, running, jumping, fighting, etc. The category of ‘blocked’? M’s comprises 
activities described as ‘“‘about to move away’’, “trying to lift’”’, “‘trying to pull apart”, 
“tugging’’, “balancing”, etc. The Chi-Square test was applied to determine the 
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significance of the differences between the observed and expected frequencies of the 
three Movement categories in the Blurred presentation of Slide III, and yielded the 
following results: for the Blocked: X? = 25.00; for the Active: X? = 19.83, both 
significant beyond the .01 level (1 df); for the static M: X? = 1.69, not significant. 


TABLE 4. COMPARISON OF THE PERCENTAGES OF MOVEMENT RESPONSES GIVEN 
TO THE BLURRED (N = 100) AND CLEAR (N = 101) PRESENTATIONS OF SLIDE III 
IN Experments A anv B 


Exper. 
Slide A&B Static M Blocked M Active M 





III Clear 18% 10% 


Blurred 25% 





DIscussION 


The hypotheses that blurring of inkblot slides increases manifestations of anx- 
iety and produces a shift in the modes of defenses are both supported by the results 
of this experiment. The drastic reduction of output as well as the increase in Vista 
responses—which certainly, in part, depend on the spatial configuration of the stimu- 
lus material—are commonly accepted as signs of increased anxiety. @ * ©) Similarly, 
the observed shift in the Movement responses from a predominance of “active’”’ to 
an increase of ‘‘blocked’”’ movements seems to indicate that the blurred stimulus 
Slide III induces an element of uncertainty and insecurity. Responses as “trying to 
lift’’, “trying to pull apart’, ‘‘tugging’’, certainly manifest a sense of strain, an in- 
hibition of acting independently and aggressively. While the responses on the clear 
exposure are weighted on the side of active role-taking, the responses on the blurred 
presentations indicate indecisiveness, hence increased anxiety. 

While the clear exposure of the inkblot slides yielded the usual high percentage 
of animal responses which are generally interpreted as primarily structural defenses 
and an index of stereotypy and adaptation’, the blurred presentation of the same 
stimulus material elicited practically no animal content. Instead, we obtained a 
large percentage of ‘“‘house’’ responses. ‘‘Houses’’ are found in the Rorschach records 
of the youngest group of children testable (2-314 years, “); in the records of adults, 
this content is almost exclusively encountered as interpretation of the darker clasp 
at the middle detail of Card VII (Beck’s D6). The content ‘house’ seems to sym- 
bolize the need for security and shelter“. Apparently we observe a shifting of de- 
fenses to a deeper, less intellectual or rational level than expressed in the “animal” 
responses. 

Additional support for the assumption of a shifting of defenses under the im- 
pact of the blurred stimulus comes from the examination of the Color reactions. 
The increase in Color reactions and the obvious shift to Color-dominated responses 
in the blurred exposure may also be interpreted as a weakening of rational, repressive 
control. Instinctual needs and tensions manifest themselves on a more primitive 
level of affective display®®. A similar interpretation is pertinent to the shift from 
the ‘‘cool” color, “green,” to the ““‘warm”’ color, ‘‘red’’: The color green is identified 
with restraint and control while red represents “earthy’’ “ emotionality. 


SUMMARY 


Psychoanalytically oriented Rorschach theory assumes that the perceptually 
unstructured inkblots activate the subject’s defenses against rising anxiety. This 
experiment was undeitaken to test the hypothesis that the blurred exposure of ink- 
blot slides might increase the amount of anxiety and lead to the mobilization of 
different defensive operations. Two groups of Affiliate Student Nurses were present- 
ed with clear and blurred exposures of Zulliger’s inkblot slides. The results of this 
study show significant differences between the reactions to the blurred and clear 
exposition of the same stimulus. 
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The blurred exposition, as compared with the clear one, yielded the following 
findings: 
1. <A decrease of output. 
2. An increase of Vista responses. 
3. The content ‘‘animal’’ was greatly reduced, and, especially on Slide I, the content 
“house” emerged. 
An increased emphasis on Color reactions with a shift from FC to CF and C. A 
significant increase in the use of ‘‘red”’ as a determinant was observed. 
5. Despite the increased production, there is a relative increase in M responses with 
a pronounced shift from active to blocked Movement responses. 


The results, supporting the original hypothesis, are tentatively interpreted as 
signs of increased anxiety, abandonment of more conventional rational defenses, 
and shifts in the operating defenses to more primitive levels of psychic functioning. 
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SUGGESTIBILITY, SOCIAL CLASS AND THE ACCEPTANCE OF 
PSYCHOTHERAPY* 
STANLEY D. IMBER, JEROME D. FRANK, LESTER H. GLIEDMAN, 
EARL H. NASH AND ANTHONY R. STONE 
Johns Hopkins University School of Medicine 


PROBLEM 

Both practical and theoretical considerations have brought about a recent in- 
creased interest in identifying variables which influence patients to accept psychiatric 
treatment. The relative scarcity of trained psychotherapists in proportion to the 
large number of patients seeking their services indicates the desirability of therapists 
expending their efforts and time more economically. Those patients who leave treat- 
ment very early without apparent benefit (and without the therapist’s approval) in 
particular are sources of frustration both to themselves and their therapists. One 

*This study is part of a larger research project sponsored by the U.S. P. H. S. under grant M-532 
(C-2), titled “A Study of Attributes of Psychiatric Out Patients Determining their Responsiveness to 
Group and Individual Psychotherapy,” Jerome D. Frank, M. D., Responsible Investigator. The 


authors wish to acknowledge the participation of Morris B. Parloff, Ph.D. (now at the National In- 
stitute of Mental Health) in the planning of this study. 
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variable which has been found to contribute significantly to premature treatment 
termination is social class“: * 5 7, An earlier study by the authors demonstrated that 
middle class patients remain in treatment longer than lower class individuals, 
This appeared to confirm certain hypotheses advanced by Auld, Myers, and Schafer 
(, 4, 6) who ascribe similar results to differences in expectancies and difficulties in 
communication between therapists and patients who are members of different social 
classes. 

Another variable which conceivably bears on whether a patient accepts treat- 
ment can be subsumed under the general term ‘‘suggestibility”’. The psychiatric 
patient, like all patients, comes to an authority, the doctor, who recommends a 
specific course (of treatment) to alleviate his illness. One trait which might be ex- 
pected to predispose the patient to a ready acceptance of this expert recommendation 
is the extent to which he is accessible to suggestion or to authority influence, a var- 
iable which probably cuts across social class lines. The purpose of this study was 
twofold: 1. To determine the effect of suggestibility on the acceptance of psycho- 
therapy. 2. To examine the relationship between suggestibility and social class. 


PROCEDURE 

The patient source was the Out-patient Department of the Henry Phipps Psy- 
chiatric Clinic, Johns Hopkins Hospital, a training institution and community clinic 
offering various types of psychotherapy. A total of 57 patients who met the quali- 
fications specified below were included in the study. All patients were between the 
age of 18 and 55 years and their diagnoses were either psychoneurosis or some form 
of personality disorder. Specifically excluded were patients with a diagnosis of 
organic brain disease, anti-social character disorder, alcoholism, overt psychosis, 
or mental deficiency. Prior to treatment assignment, patients were interviewed 
at length by a research psychiatrist. Near the conclusion of the interview each 
patient was administered a sway test’, a measure of suggestibility which has been 
used for the prediction of susceptibility to hypnosis.' The test procedure consists of 
blindfolding the patient, suggesting repetitively in a monotone that he (the patient) 
is falling forward, and then measuring the extent (in inches) which the patient actual- 
ly sways beyond his ‘‘normal”’ sway.’ Because the instrument for measuring the exact 
extent of sway was found to be unreliable, the readings were dichotomized in terms 
of ‘‘sway” or “‘no sway” rather than reported in number of inches swayed. 

During the interview the patient was told by the psychiatrist that he would be 
assigned to psychotherapy since it was his expert opinion that the patient’s symptoms 
were such that he would derive the greatest benefits from that particular form of 
treatment. The patient was also instructed that he should expect to remain in treat- 
ment for at least six months since that is generally the minimum time required for 
satisfactory results. 

The therapists were three senior residents of the Phipps House Staff. Each was 
given a roster of patients to be treated in group or individual psychotherapy. None 
of the therapists had any choice in his patients, who were assigned at random by 
the research staff. 

A social class rating was determined for each patient, applying Warner’s Index 
of Status Characteristics, based upon occupation, source of income, and house 
type ©. 

RESULTS 

Table 1 indicates that most patients who remained in treatment beyond the 
third interview tended to sway while those who terminated before that point tended 
“not to sway. Of the 36 patients who remained beyond the third interview, 24 res- 
ponded positively to the suggestibility test administered prior to their entering treat- 


‘A number of other test instruments were also administered to the patient although they are not 
reported here. 

*Prior to making the sway suggestion the psychiatrist observed the amount which the patient 
swayed while blindfolded but without suggestion. 
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ment. On the other hand, 14 of the 21 patients who attended three or less interviews 
did not respond to the test. The probability of the chi-square obtained for the fre- 
quencies reported in Table 1 is less than .05. The significance of this relationship, 
however, appears to be largely a function of the tendency of swayers to remain in 
treatment. Twenty-four (77°) of them did so. In contrast, the non-swayers did not 
demonstrate as consistent a tendency to terminate early. Only 14 (54°) of the 26 
non-swayers left treatment early. 
TaBLe 1. Sway AND ATTENDANCE FOR PATIENTS IN INDIVIDUAL AND GrRoUP 
PsYCHOTHERAPY (N =57) 


No Sway 
Number of Interviews ee ee RCCL Ae er ee eee ee IR Ee 
Per Cent 


Per Cent 





0 to3 22.6 . 53.8 
+ or more 7 | 77.4 


Total 100.0 


X? = 4.67 


A tetrachoric correlation was computed between class status (middle or lower) 
and sway score (‘‘sway”’ or “no sway’’) for all patients. The correlation was found to 
be +.16, indicating that class membership and suggestibility are relatively independ- 
ent variables. Both, however, are related to a patient’s acceptance of treatment.* 
Hence, it seemed plausible that together they might provide a crude screening de- 
vice which would be more efficient than either alone in discriminating the “stay” 
from the early ‘‘dropout”’ patients. 

Since both middle class membership and a tendency to sway appear to be re- 
lated to staying, those patients possessing the two characteristics simultaneously 
would seem to be the most likely stay candidates. In the present sample 11 patients 
were middle class swayers and all but one actually did remain in treatment, the 
single exception attending three meetings and shortly thereafter entering treatment 
with a private psychiatrist. Conversely, those patients who were both lower class 
and non-swayers might be considered the least likely to stay. There were 19 patients 
in this category and 12 of them terminated early. 


DISCUSSION 


The extent to which a patient accepts psychotherapy is a function of many 
factors in addition to his ostensible need and expressed desire for it. If he is responsive 
to suggestion and, in particular, susceptible to authority influence, the likelihood is 
that he will accept psychotherapy as recommended by the professional expert, the 
doctor. In addition to the individual influence of the doctor, the prestige value of the 
medical setting and the general reputation of the clinic in the community contribute 
to the overall effect on the suggestible individual. All these factors probably were 
operating in the present study where patients came to a clinic in a large community 
hospital of repute and were examined at length by a doctor who prescribed a specific 
course of treatment, psychotherapy. Under such conditions a large proportion of 
suggestible individuals readily accepted the treatment. 

If, in addition to being suggestible, a patient was also a middle class member, 
the probability of his acceptance of psychotherapy was greatly increased. The mid- 
dle class patient frequently is found to have some sophisticated understanding of the 
methods and goals of psychotherapy. His expectations and anticipations are, there- 


In the earlier paper that reported the social class study (3), a slightly different cut-off point (4 
or more interviews) was used, and three additional patients were included. A recomputation of the 
data on class and attendance using the present cut-off point and present population produced results 
similar to those noted in the cited study. 
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fore, likely to be met. Also, his interests and background are such that he ordinarily 
experiences no serious difficulty in communicating with a therapist whose social class 
is not dissimilar from his own. Typically, therefore, the middle class patient is pre- 
disposed toward the acceptance of psychotherapy even before he arrives at the 
clinic. His positive attitude will obviously be reinforced if he happens to be sug- 
gestible and finds an authority figure recommending a course of action to which he 
already is positively attracted. Middle class patients appear to be no more suggest- 
ible than lower class ones but those that are tend to be highly receptive to psycho- 
therapy as a form of treatment. 

Lower class patients, on the other hand, are much more likely to come to the 
clinic either unaware of the essentials of psychotherapy or with a distorted concep- 
tion of it. Many cannot easily tolerate a non-medical type of treatment with its 
implication that their complaints may not be “real” since they are not treated on a 
physical basis. Furthermore, because of their different expectancies and social class 
background they are likely to find communication with the therapist a difficult mat- 
ter. Hence the lower class patient is less often favorably disposed toward psycho- 
therapy. If, in addition, he is not accessible to authority influence, the possibility of 
his terminating psychotherapy early is quite large. 

Although a suggestible patient, regardless of his class, is likely to accept psycho- 
therapy,‘ it does not necessarily follow that a non-suggestible individual will reject 
it. In the present study the non-swayer was almost as likely to remain as he was to 
terminate early. Apparently the decision to reject treatment depends on the pre- 
sence of other factors, such as lower class membership, in addition to non-suggesti- 
bility. In any event, the sole fact that a patient does not sway permits no prediction 
concerning his acceptance of treatment. 

In this connection, it must be pointed out that the absence of a positive res- 
ponse to a test of suggestibility does not imply the presence of counter-suggestibility, 
the characteristic we might expect to bear a relationship to non-acceptance of psycho- 


therapy, under the conditions of the present study. The non-swayer may or may not 
be counter-suggestible but the sway test as utilized here does not permit the identi- 
fication of this oppositional or negativistic characteristic. 


SUMMARY 

Using the Sway Test as a measure of suggestibility it was found that patients 
who sway tend to remain in psychotherapy. It was postulated that suggestible 
patients were influenced by the authority of the doctor in his role as expert advisor, 
the general prestige of the medical setting and the reputation of the hospital in the 
community. No difference in tendency to sway was found between middle and lower 
class patients but middle class patients who did sway seemed to be most responsive 
to psychotherapy, in terms of stay-rate. Lower class patients who did not sway 
tended to reject treatment. 
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PROBLEM 


An important need, in terms of most efficient utilization of mental hygiene 
clinic services, is a method for initially identifying patients who, for various reasons, 
will terminate treatment after only a very few visits and without the consent of the 
therapist. In very many instances this termination actually occurs immediately 
after the clinic intake procedure has been completed and before the therapist has 
even had an initial interview with the patient. Various surveys on VA clinic patients 
(6, 6 10, 11) show that anywhere from 30% to 60% of the patients referred to a VA 
mental hygiene clinic for treatment will terminate, without consent of the therapist, 
within the first six treatment visits. 

A number of studies have been concerned with this problem of premature term- 
ination’: 4 ® 7 8 1). Qne of the most comprehensive is Hiler’s study of over 300 
veterans who applied or were referred for treatment at the Detroit VA Mental Hy- 
giene Clinic“. Hiler carefully set up an original and cross-validation sample using 
two extreme groups: (a) those who terminated within five sessions and (b) those 
who remained twenty or more sessions, and found various indices on the Rorschach, ° 
Wechsler-Bellevue, and Sentence-Completion Test which successfully discriminated 
the two groups in about two-thirds of the cases. Socio-economic status and education 
also differentiated the two groups. The purpose of this present study, part of a larger 
investigation on evaluation of psychotherapy, was to further identify specifie person- 
ality variables and/or socio-economic variables which differentiate individuals who 
remain in treatment in VA outpatient clinics from those who terminate prematurely. 


Mernop 


The sample consisted of 128 veteran patients from nine different VA mental 
hygiene clinics? throughout the country. These veterans were all male, nonpsychotic 
patients who had been accepted for intensive treatment. They had not been hospital- 
ized within the past year for neuropsychiatric illness and had not previously been 
in treatment at a mental hygiene clinic. This sample was selected from a larger 
group and consisted of two extreme groups in terms of length of treatment. One 
group of 60 patients had been in treatment for at least six months, or a total of 26 
visits, since most patients are seen on a once a week basis. These constituted the 
‘‘Remainers”. The other group of 68 patients had been in treatment five or less 
visits and then terminated treatment against or without advice of the therapist. If 
treatment was considered to be completed within this time, or if the veteran moved 
to another city in this period he was not included in this group. This is the ‘““Termin- 
ator” group. The measures used in the study were: 

(1) A 50 item personality inventory, taken from a longer inventory by Apple- 

zweig and Dibner’, devised to measure so-called psychopathic personality char- 

acteristics. It is a true-false personal inventory including items of personal 
history and items on individual attitudes. The items included in our inventory 
emphasized nomadism and other traits commonly considered descriptive of the 


‘From the Veterans Administration, Veterans Benefits Office, Wash., D. C. The authors grate- 
fully acknowledge the assistance of Elizabeth Turk and Betty Shanks with the statistical computations. 

2We are indebted to the VA Mental Hygiene Clinics and the respective clinic staffs in the following 
cities for cooperating in the collection of this data: Baltimore, Chicago, Denver, Detroit, Los Angeles, 
Newark, St. Louis, St. Paul, San Francisco, and Washington, D. C. 

’Applezweig, M. H. Personal communication. 1950. 
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“psychopath” such as lack of real personal ties, lack of impulse control, restless- 
ness, aggressive acting-out and frequent trouble with the law, lack of ethical 
standards, hostility toward authority, and lack of goal persistence. We call this 
measure Personal Inventory (A). 


(2) A 25 item scale taken from the Adorno, Levinson F scale“. In terms of 
our own study, its subcategories, conventionalism, authoritarian submission, 
authoritarian aggression, stereotypy, and projectivity seemed important dimen- 
sions to measure. We call this scale Personal Inventory (B). 

(3) A self-rating scale consisting of a two-part 21 item five-point graphic 
rating scale. Part I is a rating of the actual self. Part II is a rating of the in- 
dividual as he would like to be on the same 21 items. These scales were develop- 
ed partially from a study by Brownfain®?, and partially from our own work 
with the MSRPP®. The items cover such areas as intelligence, interpersonal 
relations, personal standards, and estimate of personality adjustment. The 
scales yield three sets of scores; a self-rating score, an ideal self-rating score, and 
a “discrepancy score’ or difference score between self-rating and ideal self- 
rating. 

(4) The 15 word vocabulary test selected by Thorndike from Form L of the 
Revised Stanford-Binet. 


(5) Specific socio-economic and personal items about the patient, including: 
(a) age, (b) years of education, (c) a three-point scale of occupational status, 
(d) religion, (e) marital status, and (f) percentage of disability according to VA 
compensation. 


The method of analysis was the double cross-validation, in accord with which 
the two groups were further subdivided into four subsamples, two sets of 30 remain- 
ers and 34 terminators each. These were formed by taking half the Terminator 


patients and half the Remainer patients respectively from each clinic and placing 
them alternately into subsample A and subsample B. In addition, the attempt 
was made to assign for any one clinic, an equal number of a particular therapist’s 
patients to each of the two subsamples, both in terms of Remainer and Terminator 
patients respectively. Thus, each clinic and each therapist had almost equivalent 
representation in subsamples A and B. 


RESULTS 


The Terminator and Remainer groups were compared independently for sub- 
sample A and B. Two of the original measures, namely Personal Inventory (B) and 
vocabulary, differentiate the Terminator and Remainer groups at .05 levels of con- 
fidence or better. In addition, all the remaining measures, except age, show con- 
sistently similar trends for both subsamples, A and B. Chi-square values on religion 
and marital status showed no significant differences between the groups. 

The various measures, again independently for subsamples A and B, were then 
examined individually. Six of the measures, Personal Inventory (A), Personal In- 
ventory (B), Self-rating-Discrepancy score and Vocabulary, Education, and Occupa- 
tion offered the most possibility for refinement as discriminating indices. Vocabulary, 
Education, and Occupation because of their already well-established interrelation 
were combined into a single index. Item analyses were done for the other three 
measures and discriminating items were combined. In most cases Chi-square was 
used, with Yate’s correction if Fe < 5, and items at P < .20 were retained. Where 
Chi-square was not appropriate, ¢ tests were used. Again, all items with values of 
P < .20 were retained. The results of these item analyses in terms of discriminating 
efficiency of the combined items are shown in Table 1. Table 2 presents the results of 
cross-validating the four measures. 
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Tasie 1. MEANS AND t VALUES OF Four COMBINED MEASURES ON THE Two GROUPS OF 
TERMINATORS AND REMAINERS 








Subsample A Subsample B 
Measure Term. Remain. ‘rm. Remain. 





2.54 1.07 
2.92 3.44 

.73 11.03 . Of ; ‘ 
30.20 34.90 2. dé 29.64 33.14 








* Significant beyond the 10% level 
** Significant beyond the 5% level 
*** Significant beyond the 1% level 


TABLE 2. Cross-VALIDATION OF FouR ComBINED MEASURES ON THE Two GrRoUPS OF 
TERMINATORS AND REMAINERS 











B Means from A Measures A means from B Measures 


Measure Term. Remain. Term. Remain. 








2.58 2.20 ‘ | 2.30 1.47 
3.04 3.39 2.62 2.98 

6.03 11.48 if 1.44 
29.64 33.14 30.2 34.90 


*Significant beyond the 10% level 
**Significant beyond the 5% level 
***Significant beyond the 1% level 





More pertinent to the use of these indices as predictors is the percentage of 
cases correctly identified as either Terminator or Remainer for each index. Table 3 
gives these percentages and the percentages identified by cross-validating the meas- 
ures derived from each sample against the other. 

TABLE 3. PERCENTAGE OF Cases IDENTIFIED BY COMBINED MEASURES 








Original Sample | Cross-validating Sample Total Sample 
A B A B | 


Measure 





PI-A 68 67 | 52 62 | 66 
PI-B 68 60 60 65 63 
SR-D 65 68 | 62 62 | 65 
VEO 62 62 | 6§2 58 63 





The results of the two ‘ndependent analyses were then compared to provide 
items for a final combined set of indices. These final combined indices were com- 
posed of items in the original two subsamples the product of whose two independent 
probability values were beyond the .05 level. ‘The final combined indices for the four 
measures were then applied to the entire group of terminators and remainers. Table 
3 also shows the percentages discriminated by these total combined indices. 


DIscUSSION 


Our results seem to indicate that discrimination between potential terminators 
and remainers in psychotherapy is possible with the use of a few relatively short 
and objective testing instruments. However, at this stage of research these measures 
seem more interesting in terms of their psychological meaningfulness than in their 
sheer predictive ability. Our results do seem to confirm earlier studies as to the gen- 
eral difference between terminator and remainer. Thus, remainers tend to be more 
intelligent, better educated and have higher-level jobs than terminators. At the 
same time, the remainers are rated somewhat sicker according to VA per cent of 
disability than terminators, are more dissatisfied with themselves, feel worse, and 
see themselves as having poorer interpersonal and overall adjustment than termina- 
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tors. It is important here to keep in mind the possible difference between how the 
patient views himself and the objective facts. For example, there is no real conflict 
between the fact that the remainers hold higher socio-economic status and their own 
feelings of personal dissatisfaction. 

If we consider premature termination of treatment as a meaningful and direct 
response to the therapeutic situation as such, by inference we get some fairly clear 
indications of what sort of person can accept and remain in psychotherapy on a 
voluntary, outpatient basis ina VA Mental Hygiene Clinic. It should be pointed out, 
however, that identifying remainers in treatment is not necessarily synonymous with 
identifying improvement in treatment. 

The presence of a minimal level of active and goal-directed persistence towards 
culturally approved goals seems crucial for continuance in treatment. This is part of 
the general pattern in which remainers show themselves to be more ‘‘successful’”’ 
than terminators in cultural and social activities. The remainers have a higher socio- 
economic level, more education, and they report fewer job changes. On the other 
hand, the terminators as a whole report more aggressive acting out and trouble with 
the law, more hostility toward authority, more restlessness and nomadism. 

In terms of remainers being more ‘‘successful’’, it may not be quite as trivial a 
point as it sounds to suggest that remainers remain because they are more prone to 
finish something once they start. Of the remainers, 56°, graduated from elementary 
school, high school, or college. Only 34% of the terminators completed one of these 
three levels of schooling. Even taking into account the fact that remainers tend to 
have somewhat more education the significance of the difference between the two 
proportions of “graduates” is beyond the .01 level. 

Perhaps an equally important characteristic of the remainers seems to be the 
implication that they are likely to be neither as rigid in some of their personal atti- 
tudes nor as lacking in impulse control as the terminators. In terms of the Personal 
Inventory (B), or attitude scale, the remainers show less so-called conventionalism, 
stereotypy and less of an uncompromising insistence that “right”? and ‘‘wrong” can 
be sharply distinguished. It is interesting to note that while remainers as a group are 
apparently more willing and/or able to accept and live by cultural norms they are 
less prone to voice acceptance of rigid conventionalism and are less extremely opin- 
ionated. 

In terms of the Personal Inventory (A) the terminators show themselves gen- 
erally more impulsive, more restless, and nomadic than the remainers. In addition, 
there seems to be an important and distinct relationship in both groups between the 
degree of self-dissatisfaction, as shown in the Self-rating discrepancy score and the 
restlessness and lack of impulse control reflected in the Personal Inventory (A). 

Configural scoring also reveals certain striking differences between the two 
groups. Each of the four indices was dichotomized at the median and the resulting 
patterns were examined. In thus contrasting the two groups it is evident that a 
markedly higher proportion of terminators (37°) than remainers (13°7,) have a 
combination of relatively low self-dissatisfaction plus much restlessness. An even 
greater and opposite difference in proportions of remainers (35%) and terminators 
(3%) show a combination of high self-dissatisfaction plus little restlessness. If we 
make a free interpretation here, this finding suggests that a potential remainer shows 
motivation for treatment by expressed self-dissatisfaction and implements his “‘stay- 
ability’ by high frustration tolerance for the long-term aspects of psychotherapy. 
A highly likely terminator shows the reverse of this combination. 

While the above combination is one of the most clear-cut, other combinations 
of pairs of scores show similarly interesting and meaningful trends: Terminators are 
more likely to be both restless and also rather uncompromising in their attitudes as 
shown in Personal Inventory (B) (44% of Terminators and 7% of Remainers). Re- 
mainers show the reverse combination (47% of Remainers, 12% of Terminators). 
Remainers are much more likely to be both self-dissatisfied and yet of higher educa- 
tion, intelligence and socio-economic level (45% of Remainers, 997 of Terminators). 
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SUMMARY 


Test results, involving four measures, on a sample of 128 terminators and re- 
mainers in outpatient psychotherapy were analyzed in an attempt to differentiate 
these two groups. A double cross-validation procedure was used and significant differ- 
ences between the two groups were found. Remainers tend to be less nomadic, less 
impulsive, less rigid in personal attitudes and more self-dissatisfied than terminators. 
In addition, remainers have more education and a higher socio-economic level than 
terminators. Various configural combinations of these measures discriminate sig- 
nificantly between the two groups. 
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PROBLEM 


It has been suggested by Klebanoff“) that persisting interference with brain 
function, as from the presence of pathologic tissue, may have more pronounced 
effects upon overt behavior than does the experimental excision of healthy tissue. If 
this hypothesis is true, it would explain why such projects as the Columbia Grey- 
stone project ©? failed to demonstrate that many widely used psychological tests pro- 
vide valid indices of brain damage. This study re-opens the question of validity, in 
the light of two comparisons: patients in whom pathologic tissue was removed were 
compared as to psychologic functioning with those in whom it remained; and various 
physical indices of brain damage were correlated with the psychologic findings. 


1The author wishes to acknowledge his great indebtedness to Alfred K. Baur, Henry Schwartz, 
and Howard R. White, whose encouragement and help made this report possible. 
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METHOD 

Five years after receiving a head injury sufficiently severe to produce habitual 
seizures, 31 male veteran patients were examined with a variety of psychologic tests. 
The findings are reported in order to answer the following questions: (1) What effect 
(upon psychological functions) had the surgical removal of scar tissue in 13 of these 
patients, as compared with those in whom no surgery was done and as compared 
with their own pre-traumatic status? (2) Which psychologic functions reflect (at 5 
years, spontaneous improvement is considered to have ended) the presumably per- 
manent effects of the original trauma? The 31 patients represented all patients with 
post traumatic epilepsy entering the hospital during the period of the study who had 
sustained a head injury five years or more previously. The median age was 23, and 
their ages varied from 20 to 53 at the time of the original head injury. Median educa- 
tion was 11 grades, with a range from 4 grades to 4 years of college. The median 
verbal intelligence level was 100, with a range from 76 to 137. All the patients de- 
veloped seizures as a result of the injury. The median post traumatic unconscious- 
ness, an index to the severity of the blow, was one week, with one patient being un- 
conscious for several weeks. 


Psychological Measures. The following tests were used to measure psychological 
functioning: Wechsler Bellevue verbal and performance intelligence, and immediate 
memory; Shipley-Hartford conceptual! test; color-form sorting task; Porteus Mazes; 
Bender Gestalt; and Rorschach. The color-form sorting task, in which the patient is 
asked to find a way of sorting varicolored geometric figures, is scored “‘correct’’ or 
“fncorrect’’; it is scored ‘‘correct”’ if the patient sorts first for geometric form, then 
for color; it is scored ‘‘incorrect”’ if the patient offers any other solution. The Bender 
drawings were assigned a numerical rating, one to five, by clinicians who used as a 
criterion the definiteness and severity of the cerebral pathology indicated by the 
drawings. Bender’s monograph“? and Hutt’s work“? contain sufficient explanation 
of the interpretation of Bender drawings. Each measure was used asa variable. In 


addition, a combined score of all 9 tests was calculated to indicate the general level 
of psychological functioning of the patient. Finally, a ‘‘neuropsychologic status”’ 
score was calculated by combining the scores of each patient on the five measures 
regarded as most sensitive to brain disease: Bender Gestalt, Shipley-Hartford, Port- 
eus Maze, Color Form Sorting, and Rorschach (R). Pretraumatic psychologic 
status was measured indirectly by accepting years of education as an index. 


Measures of Organic Damage and Status. The first hypothesis calls for comparing 
patients who had had scar tissue removed with those who show similar post trau- 
matic sequelae but who did not have scar tissue removed. The criteria for executing 
this operation were: (1) the patient’s seizures were not medically controllable; (2) 
an epileptogenic focus had been found; and (3) the focus was in an operable location. 
The non-operated group is thus assumed to retain its pathologic tissue, since a 
seizure-proneness (though medically controllable) existed. The second hypothesis 
called for comparing the psychologic measures with several parameters of brain 
damage: duration of post traumatic unconsciousness; locus of damage; diffuseness of 
damage. Locus of damage was defined in two ways: the lobe in which the injury was 
located, and the hemisphere. Diffuseness of damage was defined in terms of whether 
there was a focal or a general EEG pattern. 


RESULTS 

Effect of Removal of Scar Tissue upon Psychological Functions. There is a tend- 
ency for the surgically treated patients to exceed the medically treated group in all 
the psychologic functions measured. This tendency is not quite statistically signifi- 
cant, and perhaps could be attributed to original sampling differences between the 
groups. A second finding casts doubt upon the sampling explanation: when each 
patient’s neuropsychologic status is correlated with education (the best pre-traumatic 
estimate of intellectual functioning), a significant Pearsonian r is found for the surgi- 
cal group, but not for the medical group, as shown in Table 1. This finding, which is 
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also duplicated when overall psychologic status is correlated with years of formal 
education, indicates that some factor has intervened to upset the normally moderate 
consistency between intellectual status at two points in the life history in the case 
of the medical group, but not in the case of the surgical group. One infers that this 
factor is the uncorrected sequelae of the brain trauma. 


Relationships between Psychological Functions and Brain Status. Several attri- 
butes of brain status were employed, and varying results were obtained depending 
on the parameter used. Duration of post traumatic unconsciousness, often considered 
to reflect the severity of an head injury, was inversely related to such cognitive 
measures as verbal intelligence and immediate memory. These relations hold at the 
.O1 level, even when the effects of education are partialed out. The highest correla- 
tion of this index of severity was with the discrepancy between intelligence and mem- 
ory. However, this is not to say that memory lagged behind intelligence in the more 
“damaged”’ patients, as is conventionally supposed. In half the cases, intelligence 
lagged behind memory. The location of damage, in the anterior-posterior dimension, 
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Physical Variables 


Psychologic Duration of Locus of Diffuse Damage on 
Variables Post-traumatic Damage toward Pattern on Right Side 
Unconsciousness Frontal Lobe E.E. Gram! of Brain? 
r (N =15) r (N =15) X? (N =28) X? (N =19) 

Overall 37 ; ; 
Neuropsychologic 37 of 4.: 5.4°* 
Rorschach R oo 5! = (n. 8.) 
Verbal Intelligence J 74 = (n. 8.) 
Immediate Memory ; ‘ 8. (n. s.) 
Discrepancy between 

Intelligence and Memory 8 .8. Bs (n. 8.) 
Education (estimate of 

Pre-traumatic function) 3: : (n. s.) 


** 
* 





*Significant at .05 level ‘Differences in favor of patients with focal electroencephalograms 
**Significant at .01 level 2Differences in favor of patients with damage in left hemisphere 


was found to be related to intelligence, memory, and also to Rorschach productivity. 
In its right-left dimension, location of damage was found to be related to combina- 
tions of the psychological measures. That is, overall psychologic status and neuro- 
psychologic status were more greatly impaired by damage in the right hemisphere. 
The pattern of psychological findings with regard to degree of focalization (as deter- 
mined by the EEG) was the same as that of the ‘“‘sidedness’’: overall and neuropsy- 
chologic status were impaired in the cases with non-focal EEG patterns. 


CONCLUSIONS 

The problem examined in this paper is the long range psychologic effect of head 
injury under two conditions involving the type of treatment given and the original 
severity of trauma. 31 male patients who had developed seizures after a ‘head injury 
were examined five years post traumatically. Three types of analysis were perform- 
ed: comparisons of patients whose cortical scar tissue (theoretically present in all 
cases of post traumatic epilepsy) was removed with patients whose scar tissue was 
not removed; comparison of each patient’s post traumatic status with an estimate of 
his pre-traumatic intellectual status (years of education); and comparison of the 
psychological measures, separately and in combination, with various measures of 
brain status. The first two comparisons are relevant to the question of whether the 
continued mterference with normal brain function of pathologic tissue has more 
adverse psychologic effects than when that tissue has been excised. The findings 
are consistent with the hypothesis that it does. The surgically treated patients have 
higher total scores (all 9 psychological test scores combined additively) and higher 
‘‘neuropsychologic status’”’ scores (5 psychological tests considered sensitive to brain 
damage); the total scores and neuropsychologic scores of surgically treated patients 
have higher correlations with their pre-traumatic status than have the same scores of 
medically treated patients. The statistical substantiation of these hypotheses is 
suggestive but not conclusive. Regarding the effects of original severity of injury, 
the findings are that intelligence, memory, and the discrepancy between them (in 
either direction) are related to the duration of post traumatic unconsciousness. These 
correlations are —.57, —.61, and +.80 respectively. The overall scores, in contrast, are 
not as closely related to this measure of severity of damage. The other measure of 
severity (the electroencephalographic findings of a focal vs. diffuse pattern) is re- 
lated at the .01 level to the overall psychologic status and neuropsychologic status 
scores, but not to the separate measures. The localization evidence obtained with 
reference to hemisphere and lobe damaged is more difficult to interpret® 4 ©. In 
general, these findings strengthen the views of the equipotential theory criticism of 
experimental, specific-excision studies. If brain tissue is more conspicuous in its 





PSYCHOLOGIC FINDINGS FIVE YEARS AFTER HEAD INJURY 353 


effects on psychologic functioning when the tissue is pathologic than when it is al- 
together absent, then limited excision of healthy tissue will not reveal much evidence 
regarding the validity of psychologic tests. 
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SOME OBSERVATIONS ON PATIENT GOVERNMENT: 
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PROBLEM 

One of the difficulties in treating the individual in a psychiatric institution is 
that each individual must give up certain private features of his behavior which set 
him apart from other people, and become part of a large group in which much of his 
individual identity is lost. By necessity, each psychiatric patient foregoes some of 
his individualized problem-solving methods and allows the institution to regulate a 
large part of his life. This is especially true of the chronically hospitalized patient 
who, throughout years of institutional life, must adopt the group patterns of be- 
havior if he is to become a ‘“‘good” patient. The outcome of this process was aptly 
expressed by a recovered chronic patient who said: ‘‘] find it hard to think for myself, 
the hospital has done so much of my thinking for me these last few years’’. 

In a small way, patient government, as practiced in the Continued Treatment 
Service of this hospital, seems to give the chronic patient an opportunity to again 
become an active participant in attempting to resolve his individual needs with the 
demands of his environment. The patients are exposed to a situation in which they 
recognize and discuss problems of living together, and attempt to apply their own 
solutions to them. They are able to see the results of their attempts to solve their 
problems, and regain some of their former problem-solving skills in an institutional 
setting. 

This paper presents the structure of patient government as it has functioned in 
the Continued Treatment Service of this hospital, some of the problems patient 
government presents to an institution, some of the parameters of a successful patient 
government operation, and an evaluation of the effects of patient government on 


*This is a paper that is an outgrowth of the observations of the writer who followed the develop- 
ment of the Patient Government for two years; attending each committee meeting. Dr. Roy E. 
McDonald and Dr. Wallace Dalley actually organized and supervised Patient Government activities 
during these two years. 
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these patients who take part in it. In 1953, the first patient government group was 
organized on the Continued Treatment Service. The patients chose to: (1) Hold 
weekly meetings, without the presence of any staff members except a nurse who took 
notes; (2) to elect, each month, on their own, a chairman, vice-chairman, and secre- 
tary; (3) to present immediately to the ward physician, by means of this three-man 
committee, some of their problems of living together on a ward and some suggested 
solutions to these problems; and (4) to return to the patients with a report of the 
meeting between themselves and the physician. During the course of the two years; 
2, 3, and 4 of the above mentioned steps were altered with some unhappy results 
which are presented, in part, in the section on the parameters of a successful patient 
government. 


PROBLEMS 


Staff. The aides immediately became somewhat defensive, wondering out loud, 
“Who is running this ward, the staff or the patients’? Some of the first problems 
presented to the physician by the patient committee centered around the ‘‘foolish”’ 
routines they must follow because of rules and regulations enforced by the aides. 
The patients were successful in changing some of these ‘foolish’? routines. The 
physician dealt with the complaints realistically, going along with valid complaints 
and setting limits when unrealistic demands were made. This led to the second 
phase of patient-staff relationship. 

The patients, after receiving some concessions, began to directly criticize some 
of the aides. This seemed to bring out into the open the real problem, namely: Who 
will the physician remain loyal to, the aides or the patients? Once this problem came 
into the open, the physician was able to clearly define his loyalty to both groups by 
stating that he would listen to any realistic complaint; but, except for an extreme 
case, would expect the patients and the aides to work out their problems of living 
together. 


One other bit of ‘testing’ which was evident in the first meeting was revealed in 
the type of demands the patients first made. (1) ‘‘Can we have breakfast in bed on 
Sunday morning’; and (2) ‘‘Can we sleep late Sunday morning?” The physician 


recognized that the patients were really asking: ‘“‘How far are you going to let us go 
in these demands?’’, and responded by setting realistic limits. 


Patients. The problem of the individual having relinquished many of his own 
problem-solving methods and conforming to the institutional patterns was clearly 
demonstrated in the first few meetings. The physician encouraged the patients to 
consider problems such as patient transfers to other wards, what should be considered 
in allowing a patient to go on pass, what they could do to work toward getting an 
“open ward’’, and even to consider what role the hospital was playing in getting 
them well. When the patients were asked to move away from the comfortable areas 
of “gripes” and into areas where they must begin to consider the structure of the in- 
stitution and their part in it, their fear of entering into this area was loudly expressed. 
One of the most common reactions of the patients to these demands that they be- 
come active in considering their part in this was: ‘“‘We can’t even think of that, that’s 
the doctor’s business’. Except for one or two ‘‘brave souls”, they were content to 
follow the pattern of permitting the institution to do much of their thinking for them. 


Some PARAMETERS OF A SUCCESSFUL PATIENT GOVERNMENT OPERATION 


Four steps outlined in the paragraph under Structure were temporarily altered 
in the operation of the patient government within the two year preiod. Needless to 
say, if any person or group is successful in remaining in office more than one month 
(step 2) the interest and involvement of the other patients begins to lag. This was 
attempted on two occasions, once the patients successfully dealt with it, once (on 
another ward) they passively accepted it but rebelled by not taking part in the 
patient government program. This was one of the factors leading to the termination 
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of the patient government on this particular ward.' If the patients are not able to 
present their problems and solutions to the ward physician within a short time 
(step 3) they begin to loose interest in patient government since their problems are 
not promptly considered. This has also happened with the result that no meetings 
were held for a two-month period. Active interest in the part of the ward physician 
would seem necessary in the functioning of a successful patient government. 

The violation of step 4 has led less obviously to difficulties but seems to have 
more serious effects. If the physician becomes too active in ‘‘taking over’’ the patient 
government, by meeting with the patients (step 1), and/or returning to the patients 
himself and telling them of his decision (step 4), he usurps the prestige of the three- 
man council. ‘this has led, in the past, to a refusal of patients to serve on the council 
which, in turn, results in low morale and an ineffective patient government. 

An example of this illustrates why this observer feels particular attention should 
be paid to the role of the physician in patient government. After a year of successful 
patient government operation, a new physician took charge of the ward. His policy 
was to meet with the council, and to personally return to the large group and tell 
them of his decisions, thereby bypassing the council. This was done because of 
certain overall therapeutic reasons which are not evaluated here. During the first 
year of operation, net one patient, who was voted into office, refused to actively 
serve in that office. However, when a position on the council lost much of its prestige 
and importance under the new physician’s administration, it became very difficult to 
persuade anyone to accept office. The patients seemed to be saying, “‘Okeh, it’s your 
show, doctor, you run it’’. This suggests a second and third parameter of successful 
patient government; tt scems necessary that the patients feel this is their organization, 
and that the office of representative be an important one. This observer feels that these 
are probably most important in promoting a successful patient government. 


RESULTS 


General Impressions. Such results as increased morale of the patients have been 
reported elsewhere“! 2), and have also been one of our major impressions. The in- 
creased social interaction among patients has been marked. Prior to the initiation of 
patient government, it was a rare occasion to see two or more patients talking among 
themselves on the ward. They preferred to go about their own business; “‘inter- 
fering” as little as possible with one another. Now it is a rare event not to find at least 
some interaction taking place among the patients when they are on the ward. 


Hospital Adjustment Changes. Every three months each patient on the ward is 
rated by the aide on the Hospital Adjustment Scale“. This scale describes the 
degree of success the patient has reached in adjusting to his environment. The un- 
cooperative, asocial patient who interacts with neither the patients nor.the staff 
typically receives a low score on the scale. The patient who can care for himself and 
interact with those about him, typically receives a high score on the scale. While it 
is conceivable that a very dependent person may go along with the demands of the 
environment and yet be a very “‘sick’’ person; it has been our experience that a high 
score generally separates those patients who are well integrated from those who are 
confused and disorganized. We thus have a measure of ‘‘adjustment” before and 
after a patient serves as a member of patient government. 

Table 1 lists the patients who served as patient government representatives for 
the first six months (three patients per month, total of 18 patients). At this point, 
we also have additional information covering an 18-month period since these patients 
took office as representatives. Column 1, Pre-patient Government Hospital Ad- 
justment Scale Score, refers to the percentile score of each patient prior to his enter- 
ing patient government office. Column 2, Post-patient Government HAS score or 


1This is the antithesis of behavior in normal groups where successful officers are re-elected to 
office many times. The unwillingness of a patient group to allow this to happen suggests both high 
morale and perhaps childlike enthusiasm for ‘“‘my turn’’. 
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TaBLe 1. CoMPARATIVE Stratus oF Patients BeEForE AND AFTER PATIENT 
GOVERNMENT ASSIGNMENTS AS Mgasurep BY HAS Scores or Tray Visit 
Sratvs. 





Post-Pt Gov’t 

Pre-Pt Gov't HAS Score Current 
Patient HAS Score or Trial Visit Status 
A. C. 99 Trial Visit Trial Visit 
J.B. 88 99 96 Average HAS 
R. B. 88 99 Trial Visit 
S. J. 57 99 Trial Visit 
C.D. 99 99 Trial Visit 
O. H. 73 Trial Visit Trial Visit 
C. 8. 97 Trial Visit Trial Visit 
A. D. 54 71 76 Average HAS 
G.N. 97 Trial Visit Trial Visit 
A. T. 41 47 Trial Visit 
L. P. 58 99 99 
E. B. 83 Trial Visit Trial Visit 

But returned 
J. B. 94 68 Trial Visit 
Fi an 94 57 79 Trial Visit 
But returned 

J. R. 85 Trial Visit Trial Visit 
L. D. 76 83 85 
H. K. 99 Trial Visit Trial Visit 
B.C. } 96 99 Trial Visit 
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Trial Visit, refers to either the HAS percentile score immediately following the 
patient’s term in office, or information as to whether the patient left the hospital on 
a Trial Visit. Column 3 summarizes that period from six months after the patient 
served as a patient government representative to the present time. 

In comparing Column 1 with Column 2, it is to be noted that for most patients,” 
a definite change took place in their behavior while in office. With one exception, 
patient * 13, the post-office adjustment of each patient increased (patient * 5 could 
go no higher in his HAS score). That is, almost all of the patients improved in their 
adjustment during and just after their term in office.* Seven out of the 18 patients 
left the hospital before post-office HAS scores could be obtained. Thus, the HAS 
scores generally show that patients who engage in patient government office show 
improvement shortly after their term in office (Column 2). With one exception, 
these patients have not regressed to their ‘‘before’’ level of adjustment. No similar 
indications of HAS improvement are found for those patients who were not elected 
to office during this 6 month period (footnote 3). 


DIscUssION 

A strong word of caution should be inserted here. At no time were we able to 
use comparable control groups in our evaluation of the changes in behavior of 
patient government representatives. Indeed, the ‘‘mean’’ pre-office HAS percentile 
score of these two groups of those 18 patients who were elected to office was 87, while 
the mean of those 16 patients who were on the ward but not elected as representa- 
tives during this six-month period was 80. Thus, those patients who were chosen as 
representatives tended to be more verbal and took more responsibility for themselves 
than those who were not chosen. This suggests that the patients were aware of the 


demands of the office and tended to select those best equipped to handle the patient 
government. 


2Of those patients who were rated on the HAS following their terms in office, 9 improved while 
1 regressed. 

8For the 16 patients who were on the ward and who were not elected as representatives during 
this six-month period; 3 showed HAS score increases, 6 showed HAS score decreases, 3 remained the 
same, 4 left the hospital before HAS scale scores could be obtained. 
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One source of possible spuriously high HAS score increases for 18 “representa- 
tives’ lies with the aide-rater. He may have been so enthusiastic about patient 
government that he looked for and ‘found’ improvement in the patients who 
served on the patient council. This was controlled somewhat by the fact that he 
rated every patient ‘routinely’ every three months; and, since he at no time at- 
tended any patient government meetings (which were held at a time other than 
when he was on duty), he seldom was aware of who the current patient government 
representatives were. This, however, still remains one possible source of error in 
evaluating patients by their HAS scores. 

In summary, it is our impression that experience as a patient representative 
helps the participating patient to move toward a higher level of adjustment in com- 
parison with those patients who were not elected to office. The objective measure- 
ments used (HAS) support these impressions, although the criteria are not to be 
regarded as proof as to the effectiveness of patient government, but more as sup- 
portive systematic observations. 


CONCLUSIONS 


(1) There seem to be certain parameters which lead to successful patient govern- 
ment operation. In order of importance: (a) The patients must feel that this is their 
organization; (b) The position of patient government representatives should be 
looked upon as an important office and the prestige of this office should not be 
usurped by the physician“‘taking over’’ some of the responsibilities of this group; (c) 
The patient government representatives must receive active interest on the part of 
the ward physician in his willingness to promptly give his attention to their problems. 

(2) The experience of a patient as a patient government representative almost 
always results in a better level of adjustment following office as suggested by a 
change in the patient’s HAS scores before and after office. This positive change in 
level of adjustment usually did not take place for those who were on the ward but not 
elected as representatives during this six-month period. 

(3) The main purpose of patient government seems to be to help the patient 
become an active part of the institutional environment in which he finds himself 
rather than a dependent ‘‘cog”’ in the machinery of the institution. 
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AN EXPLORATION IN THE USE OF PERSONAL ADJUSTMENT MOTION 
PICTURES AS A PSYCHOTHERAPEUTIC MEDIUM 
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University of Texas The Brown Schools, Austin, Texas 
AND 
PAUL WHITE, M. D. 


University of Texas and The Brown Schools 


INTRODUCTION 


The use of motion pictures as a medium of communication has spread into many 
fields of human activity. The visual instruction bureaus of major universities* and 
the film rental libraries throughout the country can supply 16 mm. films on a great 
variety of topics. An attempt to alter attitudes by means of motion pictures is repre- 
sented by such films as ‘‘Home of the Brave,” “‘Gentleman’s Agreement,” ‘“‘Pinky,”’ 
and ‘‘Lost Boundaries.’’ The success of these attempts have been equivocal but they 
mark a trend in the use of the mass media of communication for purposes of dealing 
with social problems. Many behavior deviations on the part of children, especially 
juvenile delinquency, have been attributed to the unhealthy influence of certain 
types of motion pictures. In recent years films concerned with problems of personal 
adjustment have been increasingly employed as teaching devices at the senior high 
school, college and adult level. A variety of such films are presently available and 
serve the purpose of stimulating discussion, presenting symptomatology and demon- 
strating some of the vagaries of human adjustment and the depths of human emo- 
tionality. Despite wide use of such films on the teaching and adult education level, 
little systematic use has been made of these films with populations who are already 
exhibiting various signs of maladjustment. The purpose of the study herein reported, 
was to explore the therapeutic possibilities of some selected personal adjustment 
films in what might be described as a group therapy situation. 


PROCEDURE 
Subjects. Subjects were adolescent males at the Oaks Unit of the Brown Schools, a 


« 
residential school for exceptional children located in Austin, Texas. The students 
were selected on the basis of communality of background factors rather than by 
diagnostic categories. Common to the group were the following: (a) a history of 
school failure and truancy, (b) unhappy or broken home life marked by such factors 
as parental desertion, adoption and subsequent rejection or marital discord, and (c) 
poor interpersonal relationships resulting from a poverty of social skills usually 
evidenced by their failure in making, holding, and adding new friends. In varying 
degrees the above factors have resulted in their not being able to function adequately 
in outside environments and eventuated in their placement in the school either by 
their parents or by the agencies who were in charge of them. 

Nine students, ranging in age from 13 to 16 years, constituted the group used 
in this study. They ranged in IQ from 85 to 140. Some 12 films were shown with six 
films repeated on request of the group. Prior to the beginning of the sessions, each 
student was interviewed individually and told that he had the opportunity of at- 
tending sessions during which moving pictures concerned with “problems that people 
like himself have’’ would be shown. No student turned down this opportunity to 
attend. Meetings were held once a week for two to three hours, and tape recordings 
were made of virtually all the sessions. Analysis of these recordings and notes were 
made after each session by the authors. 


*The cooperation of the Film Library, Visual Instruction Bureau, University of Texas is grate- 
fully acknowledged. ‘ 
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Selection of Films. Rough screening of films was made on the basis of brief descrip- 
tions of films given in the various catalogues. It was found advisable to preview each 
film in order for the therapist to have a better understanding of it, and to weed out 
those films whose actual portrayal did not live up to the expectancies gained by 
reading the catalogue description. Despite the desirability of having a rationale for 
the sequence of films, this proved to be impossible and generally the availability of 
the film proved to be the chief factor in determining its place in order of presentation. 
The names of the films are listed in the order used. An asterisk denotes a film re- 
peated in later sessions by request of the group. A 16 mm. Bell and Howell Sound 
Projector was used. 
*1. Age of Turmoil . Farewell to Childhood 
o Social Sex Attitudes in Adolescence . Angry Boy 
3. Feeling Left Out *9. Feeling of Rejection 
The Gang 10. Feeling of Hostility 
The Outsider 1. This is Robert 


I 
*6. Fears of Children 12. Over-Dependency 


RESULTS 


After each film had been seen, a ‘‘bull session’? was held with no definite time 
for adjournment. The therapist began the session by asking ‘‘what did you think of 
the picture.’’ As had been hoped, the films served as convenient vehicles to stimulate 
conversation about areas which previously had met with a great deal of resistance. 
l'rom the first, the reaction of the students was generally favorable, and their parti- 
cipation increased from session to session. There were more frequent departures 
from the actual content of the films and an expanding of the problems more closely 
related to individual backgrounds. As members of the group began to know and 
understand each other’s background, there were increasingly more comparisons made 
between characters and situations portrayed in the films and those experienced in 
the lives of the students. Comment was not withheld until the end of the picture. 
Sometimes interruptions of “that’s just like my old man”’ or “‘look, that’s just what 
happened to you” were emitted during the actual showing. 

A few examples may serve to point out the emotional impact of some of the 
scenes in the films. In “Social Sex Attitudes in Adolescence,” there is a scene wherein 
an adolescent boy has crudely drawn one of his sex fantasies about his girl friend. 
His mother discovers the lewd picture. This one scene served to initiate a full eve- 
ning’s discussion about the fears most of these students had concerning their parent’s 
discovery of obscene pictures, as well as cases where they had been caught and 
punished. Included in the discussion were additional points about proper dating 
behavior and the difficulty of getting parents to ‘‘trust”’ them. The film “The Feeling 
of Rejection” seemed to have a very strong impact on most of the group. When it was 
finished, one of the boys said, “that really touched me” and he spoke of his mother 
liking his three step sisters better than him. He elaborated about his mother’s going 
off and leaving him and about his grandmother’s taking charge of his upbringing. 
Another boy spoke about the clothes-judging scene in the picture. He compared it 
to his own situation where he could not make his own decision about what to buy 
and how scared he was whenever he had to make any decision at all. Still another 
student, a 13 year old adopted boy, cried and said that the picture had hit him pretty 
hard. Tearfully he said: ‘‘Why didn’t you take a club and hit us on the head with it. 
It would have hurt less than that ---- picture. When I saw what was happening it 
suddenly brought back all the things that I tried to do but never succeeded. It’s 
just like the picture. I guess you give up trying after awhile.” In ‘Fears of Child- 
ren,” another adopted 14 year old boy from an unstable home was profoundly moved 
by a scene in which the father kicks at a pet well loved by the boy. He recounted 
that while his mother was dying, he brought in a cat and offered it to her—it was his 
only pet—and put it on her bed. His father, upon entering the room a little while 
later, kicked at the cat and ordered them both out of the room. He said he could 
now understand some of the hostility and passive resistance he had developed towards 
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his father. The similarity of the scene so impressed him that several months later, 
when the boy and his father were together making plans for his return to the family, 
he told his father about the incident and what it meant to him, and the insight that 
he had gained from the scenes in the picture. In ‘Angry Boy’”’ there was general 
agreement about the reason why children steal and the recounting of episodes of 
stealing in the student’s own lives. There was some argument about the role of the 
mother in the scene and hostility to the grandmother for taking over many of the 
functions of the mother. There was also hostility to the father for not playing with 
the boy and the phrase ‘‘you will work on arithmetic till you get better grades” pro- 
voked discussion about the role of the father in helping the child at school. Some 
students believed it to be a function of the father to help the child, while others held 
that it was the father’s job to earn a living and that he did not need to help with . 
homework. The mother, having frequent headaches, was also a topic of discussion. 
Some of the students believed that their own mothers used this mechanism whenever 
the situation at home became too tense. The picture “The Feeling of Hostility”’ did 
not have the impact that had been hoped for. Only one of the students (one of the 
brightest) could agree with the mechanism involved, namely overintellectualization. 
The rest were envious of the good grades attained by the girl in the picture. The 
picture ‘Farewell to Childhood” was so intimately related to the problems most of 
the students had in obtaining psychological independence that it proved to be one 
of the most popular. Typical comments were ‘‘that’s just what goes on at my house 
every night.’’ The scene wherein the school counselor asks the mother of the ado- 
lescent girl “‘When do you want her to use her 15-year-old brain, when she’s 18 or 
24?” stimulated much post-film comment about the difficulty of having parents 
trust them and letting them rely on their own judgment. Of all the films, it was the 
only one directly suggested to be shown to parents. It was felt that if most of their 
parents were to see the picture they might better understand some of the problems 
facing their children. 


Comment was not limited to the bull sessions alone. It was noted that the 


students began to change the contents of their letters home. For example, one of 
the students wrote: ““Dear Mother and Dad: Yesterday we saw a picture in group 
therapy. If you took the girl out and substituted a boy, you had me. It showed how 
tough it is for a boy of my age to have his parents understand him, and how we got 
caught in a mess and never talked about things and always felt that we were against 
each other.” 


CONCLUSIONS 

It is felt that the medium of personal adjustment motion pictures offers a 
promising means of getting at some of the problems facing emotionally disturbed 
children. Although no attempt was made to compare the efficacy of this method 
to group therapy alone or individual therapy alone, sufficient improvement was 
noted in the behavior of the students to warrant placing further confidence in 
this method. There are certain limitations in the films that were used, and the 
information gained from this study could be incorporated in future studies with this 
methodology. For example, one distinct limitation is that most of the present films 
are much too “‘pat” in the solutions given to the problems that are faced. The 
language, as far as adolescents are concerned, should be geared to the more common 
parlance of adolescence rather than the sophisticated language now employed. With 
planning and scheduling sufficiently in advance, it would be possible to set up a more 
rational schedule of presentation of pictures, starting with those whose emotional 
impact is less severe and only gradually beginning to use those films which have 
strong feelings portrayed. In this manner the security of the group could gradually 
be built up and serve as a background for dealing with emotionally loaded areas. 

From the experience gained in this study, it is felt that personal adjustment 
motion pictures may have in them an element not usually included in other psycho- 
therapeutic situations and as such may serve to enrich any group psychotherapy 
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situation. If we postulate that repression is one of the prime dynamic factors in 
maladjustment, then it may very well be that the presentation of ceitain traumatic 
situations may reach at repressed areas and allow for their release in therapeutic 
situations. It is suggested, therefore, that specialized films be constructed which 
would focus on certain specific problems which have been found to be common in the 
repressed reservoir of emotionally disturbed children. For example, being caught 
and punished for masturbation, rejection from the group, parental rejection, hostility, 
marital tensions, could be portrayed skillfully and it is hoped, therapeutically. More 
relevant comment on scenes during the actual running of the films might in some 
cases be advisable. The employing of a device whereby the problem portrayed in the 
film is turned over to the group at the end would seem advisable in some cases. Ap- 
propriate topics and levels of discussion for different age groups will have to be 
worked out. 

The increasing emphasis on mental health in its broader aspects throws a great 
burden and challenge upon all who recognize the importance of early prevention and 
the alleviation of emotional conflicts. How this challenge will be met is related to 
the welfare of our very civilization. The psychotherapeutic resources at present 
available are certainly limited and will not increase greatly in proportion to the 
needs of the growing population. It is felt that further research with the use of spec- 
ialized films will eventually make possible a means whereby many pressing adjust- 
ment problems can be portrayed and discussed. It should be emphasized that the 
mere presentation of a film does not do away with the need for a skilled discussion 
leader. Most of the present films and even more so the specialized films yet to be 
constructed, raise anxieties which have to be alleviated by complete and open dis- 
cussion in a group setting. It is also suggested that these types of films can be used 
periodically to enrich an ongoing therapy program or a community mental health 
project. 
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PROBLEM 


A recent study by Souelem“? provides a scale whereby attitudes toward mental 
hospitals can be studied, and which was originally applied to patients since Souelem 
felt that the results would give an indication of the patients’ attitudes and might 
distinguish various patient groups from one another. Her paper suggested that 
future studies might well include comparisons with normal samples since the items 
were phrased so that they could be used in either a patient or a non-patient situation. 
A review of the literature relevant to this subject failed to reveal studies comparing 
the attitudes of patients toward mental hospitals with those of other groups. The 
present study is an attempt to make progress in this direction. 

Souelem constructed a scale in which attitude is measured by studying the 
agreement of subjects’ responses with statements on a single continuum of favorable 
to unfavorable. The score for each subject is the scale value of the statements with 
which he agrees. In this study, an attempt is made to determine whether groups may 
be distinguished from one another in terms of their overall attitude toward mental 
hospitals as well as their particular attitudes toward mental hospitals as derived from 
an item analysis of the scale. 


“IThis study was performed at the Norfolk (Nebraska) State Hospital. The authors wish to express 
their gratitude to C. G. Ingham, M.D., Hospital Superintendent for his cooperation. 
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MeEtTHOD 


Subjects. Six groups of subjects were selected which seemed to have varying degrees 
of familiarity with a mental hospital setting. These were: 

1. Group I, 22 employees on the intensive treatment service of the Norfolk 
State Hospital (NSH). These employees had been exposed to an intensive training 
program attempting to familiarize them with personality theory, psychopathology 
and therapeutic attitudes toward patients on the part of a psychiatric aide. They 
work under the close supervision of psychiatric nurses and psychiatrists. 

2. Group I, 21 clerical and administrative employees of NSH. These people, 
although employed in a mental hospital, have little direct contact with patients and 
are unfamiliar with first-hand scenes of everyday mental hospital life. They have 
been exposed to some training situations, but these have been of a purely theoretical 
nature. 

3. Group III, 39 members of a service organization in the community two 
miles distant from NSH. These subjects are all business or professional men whose 
general socio-economic status is fairly high and whose contacts with mental hospitals 
have been very tangential. 

4. Group IV, 33 in-patients at NSH. Seventeen of these subjects were patients 
on the intensive treatment service of the hospital. Sixteen others were patients from 
chronic wards who were available as subjects because of their participation in group 
psychotherapy meetings being conducted by the authors. Since the statistical com- 
parison failed to reveal any significant differences between these two groups, they 
were combined for further study. 

5. Group V, 63 ward charges at NSH. These subjects were all attendants in 
charge of various wards within the institution, excluding the intensive treatment 
ward. They are thoroughly familiar with the institution and have had patient con- 
tacts over a long period of time. This group consisted of 33 men and 30 women. 
These two sub-groups were combined after statistical comparison failed to reveal 
any significant differences between them. 

6. Group VI, 52 high school seniors in the community. These adolescents had 
no more contact with the mental hospital than the group of businessmen described 
above. 


The scale developed by Souelem consisting of 72 items was administered to 
these various groups with the instructions identical to those used in the original 
study. No opportunity was given the subjects to discuss the scale until after they 
had completed it. As is usual in Thurstone type scales, only the items rated for 
agreement were scored, scale values developed by Souelem being employed for this 
purpose. Each subject was then represented for further statistical study by the mean 
score of the various scorable items in his protocol. 


RESULTS 


Comparison of Groups on the Total Scale. In Table 1, the various groups are compared 
with one another in terms of the attitudes revealed toward mental hospitals by the 
results. The highest scores represent more unfavorable attitudes toward mental hos- 
pitals, whereas the lower scores represent more favorable ones. The degree to which 
these group differences are significant is revealed in Table 2. Table 2 demonstrates 


TaBLe 1. MeANS AND STANDARD DEVIATIONS OF ATTITUDE SCALE Scores ror ALL Groups 








Factors Group 1 Group 2 Group 3 Group 4 Group 5 


Group 6 


N 52 
Mean 3.66 3.26 AS 3.§ 5 3.56 
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Tas_Le 2. Tests OF THE SIGNIFICANCE OF DIFFERENCE BETWEEN THE MEANS OF GROUPS 











Groups 


53 


. 7) 





* Significant at the .05 level. 


** Significant at the .01 level. 
that the group scores are significantly different from one another in some instances. 
The score of Group I, the employees on the intensive treatment service, does not 
differ significantly from any other group at the .01 level, although it does differ from 
Group II at the .05 level. 

The score of Group II, the clerical employees, differs significantly from Groups 
III, IV, V, and VI, beyond the .01 level. The direction of the differences indicates 
that the attitudes expressed by Group II were more favorable. The score of Group 
IV, the patients, differs significantly from Groups II, III, V, and VI, beyond the .01 
level. The direction of the differences indicates that the attitudes expressed by 
Group IV were more unfavorable. The scores of Groups III, V, and VI, although 
differing significantly from Groups II and IV, as stated above, do not differ from one 
another at the .01 level or beyond. In the above data analysis, Fisher’s table for t was 
used because of the smallness of some of the samples involved. A two-tailed test of 
significance was employed since no prediction was made for the direction of the 
differences. 


Comparison of Groups on Specific Scale Items. It was felt by the authors that addi- 
tional information could be supplied by a comparison of the groups on specific items 
of the scale. For each item, the proportion of each group giving scorable replies to 
the particular item was calculated. The procedure used was the test of significance 
for uncorrelated proportions. Again, Fisher’s table for ¢ was used, and a two-tailed 
test of significance as above. Although differences were calculated for each item, 
only those presenting significant differences are illustrated in Table 3 for the sake of 
conserving space. Items in which differences were found were the following: 
2. Being in a mental hospital is neither good nor bad. 

I would hate to work in a mental hospital. 

Mental hospitals are nauseating. 

Mental hospitals are perfect in every way. 

Many patients won’t want to be in a mental hospital against their will. 

Mental hospitals inspire no definite likes or dislikes in me. 

In mental hospitals, they prevent a person from doing what he wants. 


. I don’t like the idea of not being able to go where one wants to go, and to do what one wants 
to do in a mental hospital. 





WALTER G. KLOPFER, ALEXANDER A. WYLIE, JOSEPH S. HILLSON 


It is better for a mentally sick person to be treated at home rather than in a mental hospital. 
I would dislike being forced to go to movies or dances or ball games when in a mental hospital. 
Mental hospitals are frightening. 
Mental hospitals are the most admirable of institutions. 
Both the evils and benefits of mental hospitals are greatly exaggerated. 
The money spent on mental hospitals could be much better spent on schools. 
Mental hospitals are basically immoral. 
Most mental hospitals give patients a feeling of unrest and anxiety. 
You'd think there would be more practical ways of handling patients than mental hospitals. 
. The hospital people don’t pay enough attention to individuals and how different everyone is 
from everyone else. 
60. Mental hospitals are so poorly planned that patients hate them. 
62. Inamental hospital they try to get the patients to talk about themselves too much. 
68. One hears so many different ideas about mental hospitals that it is hard to decide whether 
they are good or bad. 
70. In the mental hospitals, patients don’t get any chance to use their own abilities to the best 
advantage. 


Tas_e 3. ITEMS DIFFERENTIATING BETWEEN Groups BEYOND THE .01 LEVEL* 
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In order to conserve space, the data reported are for only those items on which there were 
significant differences. ' 

Proportion of group indicating agreement with item. 

Difference between proportions. 

Standard Error of difference between proportions. 
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The items listed above were those which served to distinguish the various 
groups from one another in terms of the specific attitudes involved. Since only the 
most clearcut differences were considered important, only results significant at the 
.01 level are listed in the table that will be discussed below. 


Group I, the employees on the intensive treatment service, seem to consider 
paying attention to individuals of greater importance than almost all other groups. 
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They seem more aware of the feeling of unrest and anxiety engendered in patients 
than the clerical employees or the business men. They are less concerned with the 
restrictions that patients have to put up with than the patients themselves. 


Group II, the clerical employees, had generally more favorable attitudes in 
comparison with the patients; they seem to consider mental hospital less restrictive, 
less frightening, less anxiety provoking, more practical, and more patient oriented. 
They seem less aware of the restrictions than the charge attendants in Group V, and 
consider the hospitals more admirable than the high school students in Group VI. 


Group III, the businessmen, score high on neutral items. They seem less critical 
of restrictions placed on patients than the patients themselves and are unaware of 
anxiety produced by the hospital situation. Their attitudes toward working in 
hospitals seem somewhat negative. 


Group IV, the patients, have generally unfavorable attitudes, especially as 
compared to the clerical employees in Group II. They consider mental hospitals 
frightening and anxiety provoking as well as restrictive. Group V, the ward charges, 
seem to be bothered by restrictions and not aware of the frightening aspects of the 
hospital and consider it rather admirable. Group VI, the high school students, are in 
every way quite similar to the business men, having generally neutral attitudes. 


SUMMARY AND CONCLUSIONS 


This study has demonstrated that Souelem’s scale for the measurement of atti- 
tudes toward mental hospitals can distinguish various groups from one another. 
Among the groups studied the clerical employees tended to be most uncritical toward 
mental hospitals and the patients were the most critical. The employees on the 
intensive treatment service are more aware of the need of paying attention to in- 
dividual differences, whereas the ward charges are mainly concerned about the 
restriction of freedom. It is interesting to note that the two ‘‘outside” groups have 
generally more neutral attitudes but are also critical to some degree. The business- 
men and high school students seem to regard the mental hospital as an undesirable 
place of employment, and a place which is frightening to the “inmates”. The sim- 
ilarity of these two groups indicates perhaps the stereotype of the ‘‘general public’, 
i.e., being homogeneous in terms of their attitudes has some validity. 

In conclusion, it is the feeling of the present authors that this kind of an approach 
to the study of attitudes toward mental hospitals is promising and that further work 
could profitably be carried on along these lines. 
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INTRODUCTION 

The House-Tree-Person Projective Technique (later referred to as H-T-P) 
includes quantitative and qualitative psychological evaluation of free-hand drawings. 
The quantitative evaluation which distinguishes this technique essentially from 
other similar drawing techniques, such as The Drawing of the Human Figure“ or 
Drawing Completion“ provides for a scoring system and numerical evaluation of 
the performance. This quantitative scoring system devised by Buck was standard- 
ized on a “carefully selected sample of 120 adults of six intelligence levels (imbecile, 
moron, borderline, dull average, average, and above average)’’® »- ©, A scoring 
manual gives detailed directions for scoring and includes illustrative samples of 
scorable points in each item (House, Tree, Person) ; it lists 84 titles for the items to be 
scored which again include many subtitles, actually increasing the number of scor- 
able points to well over one hundred. Buck’s scoring differs from Goodenough’s 
scoring through the provision of flaw scores for missing items along with scores for 
present items. Four 1Q scores are obtained as the final result of the quantitative eval- 
uation which may serve as a rough measure of intelligence®: »- 48); but more than 
that, it is proposed to utilize these IQ scores as an objective basis for qualitative 
analysis ®?. 

Comparisons between H-T-P 1Q’s and other conventional intelligence tests, such 
as W-B Intelligence Scale, Stanford-Binet, and Otis have been made as either part of 
the original validation studies®: »- 4 or as attempts to cross-validate some aspects 
of the technique® ™). The findings in these studies show considerable variations, 
with correlation coefficients ranging from .165 to .412©) and from .41 to .746@. 
Several plausible explanations were suggested to account for such differences. The 
question, however, as to how much the results may be influenced by individual 
differences on the part of the scorers appears to be legitimate. While McCarthy 
found interscorer correlations among three scorers of .87, .90, and .92 on the Gooden- 
ough Draw-A-Man Test, a review of the literature did not show that an empirical 
scrutiny of this question has been undertaken on the H-T-P. Therefore, the present 
study dealing with scorer’s reliability in H-T-P scoring has been designed as an 
attempt to investigate to what extent subjective elements may influence the scoring. 
It is a part of the H-T-P cross-validation studies proposed by this investigator in 
1953. The problem of this study was to determine to what extent different scorers 
agree in their results on quantitative scoring of the H-T-P drawings, or in other 
words, to what extent the scorers are reliable. 


PROCEDURE 


A set of drawings was obtained in administering the H-T-P to a group of 43 college 
students who were enrolled in Mental Hygiene, Introductory Psychology, and Ex- 
perimental Psychology classes at the Richmond Professional Institute of the College 
of William and Mary. The group itself was rather heterogeneous; almost every 
college major being represented. It consisted of 27 females and 16 males. They 
ranged in age from 17 to 35 with a mean age of 21.7 and a standard deviation of 5.19. 

Three judges agreed to score the obtained sample of drawings according to the 
directions listed in the H-T-P Manual®?. Persons serving as judges had one or more 
years of graduate study in clinical psychology; they were familiar with the H-T-P 


*The author wishes to thank Mr. John ei Digiammo for his help in collecting the data and serving 
as a judge, Mr. Robert A. Toal for serving as a judge, and Mr. Edwin R. Thomas for assistance in 
statistical treatment. 
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technique and were using it as a part of a diagnostic battery for at least one year 
prior to scoring this sample. The judges were aware that their scores would be com- 
pared for research purposes. Each judge computed the scores and converted them 
into [Q’s using the conversion table as printed in the H-T-P Scoring Folder. 

Pearson’s product moment correlations were computed. The Z Test of Differ- 
ences between r’s was applied and ¢ ratio’s on these differences calculated using the 
technique suggested by Guilford “ ». **4), Conversion of Pearson r’s into z coefficients 
followed the method devised by Fisher“ ». ®®), 


RESULTS 
The Pearsonian r’s on the agreement between the judges as presented in Table 1 
and the significance of differences between the r’s as presented on Tables 2 and 3 
suggest a characteristic trend. There is an obvious discrepancy between vertical 
and horizontal comparisons of the r’s. In observing Table 1, we notice that the 


TABLE 1. PErARSONIAN r’s BETWEEN THE JUDGES (A, B, C) ON THE Four 
H-T-P IQ Scores 


Judges Per cent Raw G Net Weighted Score Good Score Flaw Score 
r r r r 





85 .90 68 
86 86 72 
89 88 .70 





TaBLeE 2. t Ratio’s ON THE z Test OF DIFFERENCES BETWEEN r’s OF DIFFERENT JUDGES ON THE 


Same Four H-T-P IQ Carecorties (VERTICAL COMPARISON) 








Judges Per cent Raw G Net Weighted Good Score Flaw Score 
Z t 


zZ 2 F4 t z t 
A-B 1.10 1.26 AT 83 .359 
A-C .97 1.29 1.35 .29 .807 91 

A-C .97 1.29 .29 91 .179 
B-C 87 44 1.4: .583 38 .404 .87 
1 
1 


A-B 10 47 .83 .179 
B-C 87 1.031 135 38 404 87 


N - 4 ; Degrees of freedom 41 


Ody -223 


TABLE 3. t RaTIO’s ON THE z TEST OF DIFFERENCES BETWEEN r’s OF THE SAME JUDGES ON DIFFERENT 
H-T-P IQ Carrcoriges (HorIzonTAL CoMPARISON) 








B-C 
t 

Raw G 10 ; : .87 4 

Net Weighted .26 42 2 .466** 

Raw G 10 § .87 

Good Score AT <a 485 38 


Factors z 


.287°** 
Raw G 10 ; .87 
Flaw Score .83 § 26 .87 
Net Weighted .26 : 42 
Good Score 47 .94: i ‘ .38 
Net Weighted 26 : .42 
Flaw Score .83 ; .87 


-000 
179 


2.466** 
Good Score 1.47 : .38 
Flaw Score .83 : ; .87 2.267°"* 





*Significant at 0.01 level 
**Significant at 0.02 level 
***Significant at 0.05 level 
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TasBie 4. MEANS AND STANDARD DEVIATIONS OF JUDGES’ ScORES ON DIFFERENT H-T-P IQ 
CaTEGoRILS (N = 43). 








Judges Per cent Raw ( Net Weighted Good Score Flaw Score 
M M S.D. M S.D. 
109.37 10.19 106.7 ‘ ‘ 
109.41 12.96 110.14 107 .60 
108.18 11.81 107.7 12.82 111.83 


judges’ agreements tend to be quite close if we compare the correlation coefficients 
obtained in comparing different judges on the same H-T-P IQ categories. Table 2 
provides additional support to this observation, showing the ¢ ratio on the differ- 
ences between r’s is not statistically significant. It seems that the judges are in- 
clined to agree or disagree about equally if we compare their scores on the same 
H-T-P IQ. 

However, if we compare the r’s on the agreement among the same judges on 
different H-T-P categories, we find some more pronounced differences. Table 1 
shows that r’s on % Raw G IQ and on Flaw Score IQ are obviously lower than they 
are on either Net Weighted Score IQ or Good Score IQ. This observation is sup- 
ported by the ¢ ratios on the z test of differences between r’s as shown in Table 3. 
The ¢ ratios on these differences tend to be much higher where the r’s on either Raw 
G IQ or on the Flaw Score IQ are included into comparison. So we have in this case 
t’s significant at .01, .02, and .05 levels. The least significant differences among the 
judges appear to be when we compare them on the Net Weighted IQ’s vs. Good 
Score 1Q’s and on the Raw G 1Q’s vs. Flaw Score 1Q’s. Of course, the findings listed 
in Table 2 indicate that low ¢ ratios have different meanings in both instances, 7.e. in 
one instance there is a consistently high agreement among judges on those two IQ 
scores, while in the other case, their agreement is consistently lower. 

The Pearsonian r’s for the agreement among the judges show a variation from 
.68 to .90. If we agree with the generally accepted theory advanced by Kelley“ ». 8° 
that the r should be .94 or above in order that the results be valid for individual pre- 
diction, our results on the Flaw Score IQ and the Per cent Raw G IQ would suggest 
that these scores may be questioned as to their value for individual predictions. Of 
course, the r’s on all IQ’s found in this study are above .50, and therefore, they all 
could be considered of value for group predictions. 

The agreement among the scorers on the Good Score I1Q’s and Net Weighted 
Score IQ’s compare well with the McCarthy findings on interscorer correlations on 
Goodenough Draw-A-Man Test mentioned previously. However, the differences 
found on other IQ scores may raise questions as to the applicability of some parts of 
the H-T-P technique for individual prediction. Although we don’t know the reasons 
for the discrepancies among the judges, we are inclined to speculate that they either 
interpreted the directions of the manual subjectively or that they did not understand 
the directions the same way. In any instance, it appears that more refinement may 
be needed to make some parts of the H-T-P scoring less dependent upon the scorer’s 
interpretation. 


SUMMARY AND CONCLUSIONS 


A set of H-T-P drawings administered to 43 college students was scored in- 
dependently by three judges. Pearson produce moment correlations were com- 
puted to determine the reliability of the scorers. It was found that the r’s on the 
Flaw Score IQ and Per cent Raw G IQ were lower than necessary for individual pre- 
diction of a single judge. However, all r’s are sufficient for group prediction. The ¢ 
ratio obtained in using Fisher’s z Test was found to be negligible when the r’s of 
different judges on the same IQ scores were compared, suggesting that there is a 
definite trend among the judges to agree or disagree the same way on the same H-T-P 
1Q’s. However, the ¢ ratios on the differences between r’s of the same judges on 
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different H-T-P IQ scores were found to be significant in some instances. In view of 
this, it was assumed that certain parts of the scoring may be presenting difficulties 
to the scorer. The suggestion was made that a possible need may exist to refine some 
parts of the H-T-P scoring system or the directions in the manual, particularly those 
dealing with the Raw G and the Flaw Scores. 
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SEXUAL DIFFERENTIATION IN FIGURE DRAWINGS 
AND OVERT DEVIATION 


FRED CUTTER 
Atascadero (Calif.) State Hospital 


PROBLEM 


The degree of sexual differentiation in the drawing of male and female figures 
has been used as a qualitative measure of sexual identification®: ©. Recently, a 
method has been devised for rating such differentiation on a nine point scale“: 5). 
While the methodology and reliability appear adequate, evidence is still sparse on 
the meaning of such measures. It has been suggested that poor sexual differentiation 
of the figure drawings may reflect general deficits or specific sexual disturbances“). 
This paper describes an attempt to verify these hypotheses. 


METHOD 


The experimental subjects were male patients at the Atascadero State Hospital 
committed as sexual psychopaths; i.e., persons convicted of sexual offenses. There 
were two groups of patients: the first group consisted of 108 sexual psychopaths on 
observation (“SPO’’); the second, a therapy group consisting of 59 sexual psycho- 
paths on an indeterminate committment. Twenty-two psychiatric technician 
trainees made up a control group. They were carefully selected for their positions 
and had no known psychiatric or sexual disturbances. 

The therapy group was divided into overt and covert deviation sub-groups. 
The overt sub-group (‘‘O’’) consisted of 22 patients who frequently practiced some 
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form of sexual deviation without regret or anxiety, and acknowledged this behavior 
when confronted. The covert sub-group (‘‘C’’) was made up of 37 patients who, 
while guilty of at least one offense, were unable to accept such deviant behavior as 
derived from their own motivational patterns. Instead, these patients shifted res- 
ponsibility for their actions to the victim or to alcohol, denied the deviation entirely, 
or protested it could never happen again. The classification of overt and covert was 
made without reference to the figure drawings by the patients’ therapist, the author 
of this paper. 

The “SPO” group was tested individually as part of the routine observation 
examinations. Among the tests administered were the DAP and Wechsler Verbal 
Scale. The therapy and control groups were given the DAP in a group administra- 
tion according to the usual directions. The therapy patients were tested in six 
groups of approximately ten each. In addition, the 59 therapy patients were also 
given the group form of the MMPI. 


RESULTS 


The test-retest method was used to estimate the reliability in rating the figure 
drawings. Inter-rater reliability was estimated by inter-correlating the author’s and 
a junior psychologist’s ratings of the same figures.* The three measures of reliability 
derived from this study are summarized in Table 1 together with the reliability re- 
ported in the use of this rating method“. 


TABLE 1. RBLIABILITY COEFFICIENTS OBTAINED IN THE RATINGS OF SEXUAL DIFFERENTIATION IN 
HuMAN Ficure DRAwINGs. 








Measures N Coefficient Source 


Tnter-rater 58 84 Swenson’s neurotics “ 
Rate-rerate (2 days) .90 Sex psychopath observation 
Rate-rerate (2 months) 5s 89 Therapy sex psychopaths 
Inter-rater 5¢ .78 Therapy sex psychopaths 





The distributions of ratings in the various groups investigated are described in 
Table 2. This also includes the ratings of the in-patients and out-patients reported 
in the original study“). The ratings for the out-patient, “SPO”, ‘‘C’’, and control 
groups do not differ significantly from each other. The chi square was 4.00 with 
six degrees of freedom. 


TaBLe 2. DistTRIBUTION OF RATINGS OF SEXUAL DIFFERENTIATION IN THB 
DAP By a CONTROL AND FIVE PATIENT Groups. 


RATINGS 
Patients | 4-6 
In-patients 
Out-patients 
“SP( ad 
“)" 
egy”? 
Control 








The distribution of in-patients ratings were significantly different from the 
remaining five groups. The only other significant difference was that between the 
“O” and the out-patient groups with an obtained chi square of 11.02 at 2 DOF 
(significant at the 1% level). 


*The author is indebted to Mrs. Edith Moreledge, M. A. for her assistance in rating the DAP of 
the therapy patients. 
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The inter-correlation of ratings of sexual differentiation with verbal IQ (Wech- 
sler) and age were significant, uncorrected for attenuation. The product moment 
correlation coefficients in the 108 “SPO” patients were +.206 with IQ and —.268 
with age. 

Analyses of variance of the ratings are reported in Table 3 by (a) type of de- 
viation, and (b) by syndromes. Part A is based on the ratings of 108 “SPO” cases 
and is not significant. The subjects in part B were 50 of the 59 therapy cases that 
could be classified into the three categories shown. The results here are significant 
and indicate that the better ratings are obtained by well integrated personalities in 
which overt sexual deviation is a primary symptom. 


TABLE 3. ANALYSES OF VARIANCE OF RATINGS OF SEXUAL DIFFERENTIATION IN FiguRE DRAWINGS. 


A: By Five CaTecoriIgs OF SEXUAL DEVIATION 











Deviation Sums of of equ ures D. F. 


Miemnese xual j 5.68 Tots “ 548 107 
Overt Child Molester Between 15 


Covert Child Molester , 4.7 Within 533 
Exhibitionist ‘ 
Rapist 





4 
103 


F = 1.38 not signifi ant 


: Br Turee C LINICAL SYNDROMES 


Syndrome N Me an 





Sums of squares 
Overt sex deviation ; 86 Total 
Neurotics ( 5.26 Between 
Personality disorganization 4.24 Within 


Total 50 F = 5.342 








The 13 MMPI variables were correlated with ratings of sexual differentiation 
for 55 of the therapy patients. Four were eliminated because of high F scores. Only 
two of the scales correlated significantly, Mf and F. The coefficients were + .525 and 

.296 respectively for raw scores with a standard error of .136. 


DISCUSSION 


The similarity of the distributions of ratings by the out-patient, “SPO’’, “O” 
and control groups contradicts the hypothesis that sexual differentiation on the DAP 
measures sexual disturbances. On the other hand, the discrepancy between the dis- 
tributions of ratings by the in-patients and the remaining groups supports the general 
deficits hypothesis. An unexpected finding is the reversed distributions of ratings 
between the out-patients and the “‘O” groups. Contrary to prevailing clinical opin- 
ion, the more overt the sexual deviation the better the differentiation between the 
drawn figures. This finding is also supported by the lack of association between 
sexual deviation categories and distribution of ratings (see Table 3). 

The significant correlation of age and IQ with ratings of sexual differentiation 
relates this variable to the efficiency of cognitive processes. The negative correlation 
with age appears to be due to a large proportion of elderly child molesters in the 
“SPO” group, whose deviant behavior is incidental to a general deterioration found 
in the presenile patients. The positive association of measured intelligence and 
ratings can be related in terms of an absence of impairments such as anxiety, i.e., to 
confirm the general deficits hypothesis. 

The negative correlation with the F scale also supports the integration hy- 
pothesis. A high F score, where scoring errors are eliminated, is shown by patients 
who are “. .. unable to comprehend the item . . .”’ or ‘. . . as a wish to appear in an 
unfavorable light which may be either deliberate or unconscious ...”’@). Both 
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possibilities occur in the mental disturbances with serious personality disorganiza- 
tions. 

High Mf scores reflect a femininity of interest patterns in males that is usually 
found in male homosexuals“. Since the personality of homosexuals is generally re- 
garded as well integrated, albeit in a deviant pattern, the femininity of interests can 
be said to reflect one aspect of such an integration. Its positive association with 
ratings of sexual differentiation can then be taken to indicate a common underlying 
factor of personality integration for both variables. ; 

Some support for the association of sexual differentiation on the DAP and per- 
sonality integration comes from a recent study “? where human figure drawings were 
used as a measure of recovery from “psychotic regression’. The author reported in- 
creased body image and sexual maturation (as judged by greater detail and organ- 
ization of the drawings) with improvement in mental status. 


CONCLUSIONS 


1. Ratings of sexual differentiation in human figure drawings provide a re- 
liable measure of human behavior. 


2. In terms of gross clinical syndromes, patients with overt sexual deviation 
(homosexuality, exhibitionism) received the best ratings. Patients with severe 
personality disorganizations (alcoholics, psychotics in remission, etc.) received the 
poorest ratings. Neurotic patients received intermediate ratings. 


3. Verbal IQ and age are associated with the ratings in terms of cognitive 
efficiency. The freer the patient is from impairment of thought, the better are his 
ratings of sexual differentiation. 


4. The MMPI scales Mf and F are associated with ratings in terms of degree of 
personality integration. The greater the degree of personality integration (as re- 
flected by high Mf and low F) the better the differentiation of the male and female 
figures. 


5. It is concluded that ratings of sexual differentiation on the DAP are a 
measure of general psychological deficits associated with the degree of personality 
integration. 
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WITHDRAWAL AS A DIMENSION OF SCHIZOPHRENIA: 
AN EXPLORATORY STUDY 


GERALD F. KING! 


Veterans Administration Hospital, Battle Creek, Michigan 


PROBLEM 


Theory and research in the clinical-personality area are basically concerned 
with functional unities in human behavior. A variety of approaches have been util- 
ized in empirical investigations, ranging from the global statistical analysis of many 
representative variables“: *) to the consideration of the generalized nature of a single 
variable“ ¢°). The present article is concerned with what classical theory has as- 
sumed to be a unitary dimension, namely, withdrawal, usually associated with 
schizophrenic behavior and also referred to as ‘environmental withdrawal” or ‘‘with- 
drawal from reality”. Withdrawal can be viewed as behavior directed away from at 
least two types of environmental stimuli, people and things. On a conceptual level, 
the behavioral processes corresponding to this environmental classification could be 
labeled ‘‘interpersonal-withdrawal” and ‘‘thing-withdrawal’’. It would follow that 
the general dimension of withdrawal would not be unitary if the components, inter- 
personal— and thing-withdrawal, were independent variables. The purpose of this 
study was to explore the behavioral unity of withdrawal using the foregoing orienta- 
tion. 


METHOD 


The patient population of a small locked ward in a VA neuropsychiatric hospital 
served as the subjects. The ward was designated for young, acute schizophrenics 
recently admitted to the hospital. A variety of symptoms and behavioral patterns 
were displayed by the patients, and a wide range in terms of severity of illness was 
represented. An attempt was made to devise separate measures of interpersonal— 
and thing-withdrawal based on the different hospital activities scheduled for the 
ward. A total of three measures were obtained in this way, and a fourth was provided 
by another study being conducted on the ward. As can be seen in the following brief 
descriptions, two of the measures referred to responsiveness towards people and two 
referred to responsiveness towards things. 


Interpersonal-withdrawal, group therapy (IW-T). The group therapist was given the following 
instructions: “Rank the patients on their responsiveness to the group. Base your judgments on 
amount of participation, whether it is constructive of not.” 

Interpersonal-withdrawal, ward observations (IW-O). The charge-attendant on the ward was 
given the following instruc tions: “On the basis of your observations of the patients in the day- 
room, rank them on how much the *y socialize with other patients. Use as a guide the number and 
duration of the social contacts.’ 


Thing-withdrawal, occupational therapy (TW-T). The occupational therapy clinic had a 
variety of projects that the patients could undertake (e.g., leather craft, weaving, painting, 
plastics). The occupational therapist was instructed: “Rank the patients on the amount of inter- 
est and energy they displ: iy in regard to the various materials and projects. Do not take into 
account their responsiveness to therapists, attendants, fellow patients, or other people in the 
Ci linie. 


Thing-withdrawal, operant conditioning (TW-C). The operant conditioning apparatus has 
been described in detail elsewhere®). Projecting from the center of a panel erected across one 
end of a small room was a wooden knob which operated like a plunger in a vending machine. By 
pulling the knob, the patient could obtain rewards of candy and cigarettes which rolled onto a 
tray below the knob. After each of the first three pulls, the patient received a reward, but, there- 
after, the rewards gradually decreased until the patient ended up on a 40 to 1 fixed-ratio schedule. 
1The author is indebted to Drs. David W. Merrell and Norman Graff for their interest and 
assistance. 

2As an arbitrary designation for this behavioral process, thing-withdrawal does not appear to be 
entirely satisfactory. Such terms as non-interpersonal-withdrawal, objectual-withdrawal, and psycho- 
motor-task-withdrawal were also considered. 
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Each patient was given three half-hour trials on three consecutive days, with the operant rate 
being established as the mean number of pulls for the three trials. The patients were then ranked 
in accordance with their operant rate. 


It was not possible to use the entire ward as a single group in terms of measure- 
ment (ranking) since the patients participated in the activity schedule in smaller 
groups. Measurements were obtained for two small samples from the ward. Sample 
A consisted of 11 subjects and Sample B, 13 subjects. The first three measures of 
withdrawal (IW-T, IW-O, and TW-T) were based on observations over a period of 
two months. In every case, the subjects were ranked independently by two judges, 
which provided an estimate of inter-rater reliability. The correlations (rhos) be- 
tween raters were all .80 or higher. The retest reliability of the TW-C measure could 
be computed since the operant conditioning procedure was repeated two or three 
months later. The rhos were above .85 for both Samples A and B. 


RESULTS AND DIscussION 

Tables 1 and 2 present the intercorrelations between the measures of with- 
drawal in Samples A and B. The two measures of interpersonal-withdrawal are 
significantly related, as are the two measures of thing-withdrawal. No relationship 
is indicated between interpersonal— and thing-withdrawal. The findings are con- 
sistent from sample to sample. Withdrawal behavior in schizophrenia would appear 
to be specific to the classification of environmental referents, whether people or 
things. Thus, the data suggest that withdrawal, rather than being a unitary dimen- 
sion, consists of two independent factors. 


TABLE 1. INTERCORRELATIONS BETWEEN THE MEASURES OF INTERPER- 
SONAL-WITHDRAWAL AND THING-WITHDRAWAL IN SAMPLE A (N = 11) 


IW-T IW-O 
IW-T a .72** 
IW-O ae — 
TW-T a oe 











*Significant at the .05 level of confidence. 
**Significant at the .02 level of confidence. 

Note: The correlations are represented by rhos, with the tests of signi- 
ficance being in accordance with Du Mas®). 


TaBLeE 2. INTERCORRELATIONS BETWEEN THE MBASURES OF INTERPER- 
SONAL-WITHDRAWAL AND THING-WITHDRAWAL IN SAMPLE B (N = 13) 


TW-C 


TW-T 








21 —.07 
—.05 .29 
— .59* 





*Significant at the .05 level of confidence. 
**Significant at the .01 level of confidence. 


Seemingly compatible with the findings of this study are selected observations 
that can readily be made in a neuropsychiatric hospital. There is the schizophrenic 
patient (usually paranoid) whom the ward personnel are unable to interest in any 
activity. The patient prefers just to sit on the ward rather than go to the gymnasium, 
hobby shop, or other hospital activities. At the same time, he constantly gives ad- 
vice to other patients and bothers the ward personnel with various requests. He 
immediately corners any new face that appears on the ward. Here is an example 
where there is no obvious withdrawal in the interpersonal sphere, but a definite lack 
of responsiveness is present in what has been termed the “thing” sphere. Another 
schizophrenic patient works faithfully and industriously in the hospital laundry. 
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When not working, however, he sits alone on the ward, avoiding interpersonal con- 
tact with both ward personnel and other patients. In this case, the interpersonal- 
withdrawal is quite apparent, but there is no indication of thing-withdrawal. Ob- 
servations of this nature support the findings of this study in suggesting that inter- 
personal-withdrawal and thing-withdrawal are not concomitant processes. 

As an exploratory study, the results are offered as tentative. They seem to be 
sufficiently unequivocal, however, to warrant further investigation in this area with 
larger samples and more precise measures. A current development in psychopathol- 
ogy is that of treating diagnosis as a multi-dimensional concept“). If the distinction 
between interpersonal-withdrawal and thing-withdrawal were confirmed in future 
research, a fragmented, general dimension could be replaced by two variables more 
unitary in nature, which would allow greater precision in the description of psycho- 
pathology. It seems axiomatic to say that greater precision in measurement is an 
antecedent condition for better prediction. 


SUMMARY 


Withdrawal, as a dimension in schizophrenia, was described as behavior directed 
away from two types of environmental stimuli, people and things. The behavioral 
processes corresponding to this environmental classification were designated as 
“‘interpersonal-withdrawal” and ‘‘thing-withdrawal’”’. The relationship between 
interpersonal— and thing-withdrawal was investigated in order to make an explora- 
tory test of the behavioral unity of withdrawal. A total of four measures of with- 
drawal were devised, two referring to responsiveness towards people and two re- 
ferring to responsiveness towards things. Although the two measures of interpersonal- 
withdrawal were found to be significantly related, as were the two measures of 
thing-withdrawal, no relationship was obtained between interpersonal- and thing- 


withdrawal. The results were interpreted as offering evidence that withdrawal, 
instead of being a unitary dimension, may consist of at least two independent factors. 
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PROBLEM 


This paper is concerned with the following problem: Can MMPI profiles be 
used to predict psychiatric diagnosis? If so, is clinical training a necessary prerequis- 
ite in such prediction? How much increase in prediction results from a consideration 
of the profiles as a whole (as judges are likely to do) in contrast with simply using the 
highest abnormal scale score as the only cue in the decision making process? Further, 
do experienced judges show greater agreement than those who have no experience 
with the MMPI or in clinical psychology? 


PROCEDURE 


We obtained our sample of 35 MMPI profiles from Borko’s study“? in which he 
reports the MMPI scores of 34 hospitalized mental patients and their psychiatric 
diagnoses. The 34 profiles included eight patients diagnosed paranoid schizophrenic, 
three unclassified schizophrenics, six manic depressives, six character disorders, five 
anxiety neurotics, three depressive neurotics, and three hypochondriacal neurotics, 
The patients were all white, male veterans whose Shipley-Hartford 1Q ratings varied 
from 90 to 120. Case number 35 was fictional and consisted of the scores obtained 
according to the general key or ‘‘normal’’ way of classifying the statements. A group 
form of the MMPI had been used which included only the 373 items which are scored 
in any of the standard scales. Three diagnoses were available on each patient which 
were obtained at different time sequences of the patient’s hospitalization. The major 
criterion employed for the selection of this sample was complete agreement among 
the three or more psychiatric diagnoses which had been made on each patient upon 
initial or intake interview, during hospitalization, and upon final or terminal inter- 
view »- ©), From a methodological viewpoint for purposes of this study, it would 
have been best had the MMPI results not been made known to the psychiatrists, 
thus avoiding any possible degree of contamination between predictor and criterion. 
According to Borko’s description: ”- *), it appears that six of the cases came from 
a research project whose criterion for selection involved three independent diagnoses 
made by a psychiatrist who had no knowledge of the MMPI results. For these cases 
the possibility of contamination between the MMPI and the diagnosis appears to 
be at a minimum. For the remaining twenty-six cases the possibility that one or all 
of the psychiatric diagnoses was made with knowledge of the MMPI results remains 
open, although it appears likely that the MMPI results would be more apt to in- 
fluence the psychologist’s diagnosis of the patient than the psychiatrist’s. Whatever 
the case, this limitation of the study should be kept in mind.? 

The first step was to plot the scores on an MMPI profile sheet and to identify 
each case with a number. When this was done, the eight categories into which the 
cases fell were typed on a sheet of paper and each judge was handed a folder con- 
taining the 35 MMPI profiles, the sheet of paper containing the eight categories and 
a set of instructions which read as follows: 


“INSTRUCTIONS TO THE JUDGE: We would like to have you sort each of 
the following 35 MMPI profiles into one of the eight diagnostic categories 


‘The authors wish to express their appreciation to Dr. Albert Kubalo and Mr. Bart Cobb for their 
statistical aid, to Dr. Saul B. Sells for his comments and suggestions, and to Lt. John Sussenberger for 
compiling the data into some semblance of order. 

2In.a personal communication Borko states that for the six cases in the clear-cut study, the MMPT 
was not used in the diagnosis. For the other twenty-eight cases he relates it is impossible to determine 
to what extent the diagnosis was influenced by the MMPI result; although knowing the men involved 
and the installation from which the cases were selected, he believes that the influence was minimal. 
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presented below. Simply indicate your decision by writing in the appropriate 
place the number which appears in the right-hand corner of the profile. Feel 
free to use any cues which are available to you in coming to your decision. For 
your information one of the profiles has been scored according to the ‘‘normal”’ 
key. Also it should be mentioned that the cases are not necessarily equally or 
nearly equally divided among the eight categories presented.” 


The only difference in the problem task confronting the judges was that the in- 
experienced judges were also permitted to use the MMPI manual (which gives a 
description of the MMPI scales) as an aid in the decision-making process. All of 
the jlidges were known by the writers and believed to be of at least superior intelli- 
gence. The experienced group of nine judges included those who had clinical ex- 
perience, those who had previously worked with the MMPI, or both. The inexper- 
ienced group had had no clinical experience or training and no familiarity with the 
MMPI. The experienced judges included two psychiatrists, one of whom is board- 
certified; four clinical psychologists; and three research and development officers, 
all of whom have had some clinical experience as well as having used the MMPI for 
research purposes. The six inexperienced judges included one physiological psycho- 
logist, two psychometricians, two college educated airmen, and one chief clerk. 


RESULTS AND DISCUSSION 


Ga the assumption that all categories are equally likely to be chosen by any 
juag. «rc that every selection is independent of every other selection, the expected 
nan rect choices for any particular judge equals the number of selections 
divideu « number of categories, or 35/8 which equals 4.375. The following equa- 
tion’ gives « 1b bility that (j) or fewer choices will be correct for any given Judge: 
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Using this formula it was estimated that 9 or more correct choices are to be consider- 
ed significant at the 3 per cent level of confidence and 10 or more significant at the 
1 per cent level of confidence. It was also determined that the probability of no one 
agreeing with a given judge’s choice is P = (1 — (7/8)"*) or .15 and further, that the 
probability that no one of the 15 judges will make the correct choice is P = (1 - (7/8 
) or .13. 

Table 1 gives a comparison of the selections of the experienced and inexper- 
ienced judges, as well as the results of a hypothetical judge X, whose selections were 
made solely on the basis of the highest abnormal T scale score. Those profiles which 
contained no T scale score above 70 were placed in the normal category. In this in- 
stance no attention was paid to Q, L, F, or KX scores. When the number of correct 
selections for the experienced and inexperienced judges was compared, a significant 
difference was found in favor of the experienced judges (X* = 8.08, p = <.01), even 
though four of the six judges comprising the inexperienced group exceeded the .01 
level of confidence in their accuracy of pkediction. At the same time it should be 
stated that not even the most accurate judge who had 19 correct selections succeeded 
in making a significantly greater number of correct selections than could be done 
simply by sorting on the basis of the highest abnormal T scale score, which yielded 
15 correct selections (X? = .914, p = >.30). In terms of clinical versus statistical 
prediction “’, these results appear to support a statistical approach for this task. 

The general unreliability of psychiatric diagnosis as a criterion notwithstanding, 
it is somewhat surprising that the judges were not more successful with their se- 
lections, particularly since whatever degree of contamination obtains between pre- 


3In this equation k= any specific number of correct choices, n = 35 which is the total number of 
selections to be made, p = probability of a correct choice on any given selection and q = probability 
of an incorrect choice on any given selection. This equation was derived from a formula presented on 
p. 44 in Edwards, A. L. Experimental Design In Psychological Research. New York: Rinehart, 1950. 
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TaBLe 1. COMPARISON OF SELECTIONS OF EXPERIENCED AND INEXPEREINCED JUDGES 


Experienced Judges f C D ; G L M 
No. correct in 8 categories 12 ‘ 14 10 19 
No. correct in 4 categories é 18 20 16 25 
X? values for individual 

judges* 45 2.88 2.10 3.66 : .69 2.20 2.20 
No. idiosyncratic selections 0 


Inexperienced Judges H J 
No. correct in 8 categories 5 6 10 
No. correct in 4 categories 21 12 19 
X? values for individual 

judges 1.43 2.69 4.77 
No. idiosyncratic selections 2 8 4 





Hypothetical Judge X 
No. correct in 8 categories 15 
No. correct in 4 categories 23 
xX? 3.68 





*For one D.F. a X? value of 6.64 is significant at the .01 level of confidence and a X? of 
3.84 is significant at the .05 level of confidence. 

dictor and criterion in our sample would be expected to contribute to successful pre- 
diction. The experienced judges as a group were successful in 40 per cent of their 
predictions while the inexperienced judges were 28 per cent successful. Even the 
most successful of the experienced judges was correct in only 54 per cent of his 
selections. This degree of success in predicting psychiatric diagnosis is not high 
enough to warrant using the MMPI for accurate, individual diagnosis which is 
probably a major requirement of any clinical psychological test. Moreover, consid- 
ering the method with which the sample was selected (i.e., the requirement that all 
diagnoses on any patient included in the sample be in perfect agreement) our findings 
probably represent a maximum degree of successful prediction with the MMPI. ~* 

In order to determine whether the judges would have been significantly more 
successful had they been required to sort into only four rather than eight categories, 
the judges’ selections were analyzed again. The eight categories were combined into 
four by retaining the normal and character disorder classification and grouping the 
paranoid schizophrenics, unclassified schizophrenics and manic-depressives into a 
psychotic category and the anxiety neurotics, depressive neurotics and hypochond- 
riacal neurotics into a neurotic category. None of the Chi square values for any of the 
individual judges was significant at the .01 level of confidence. This apparently 
demonstrates that no individual judge was able to make the neurotic-psychotic diff- 
erentiation significantly more accurately than he could the finer differentiations. For 
the judges as a whole, however, as well as for both the experienced and inexperienced 
groups of judges, all three Chi square values were significant beyond the .01 level of 
confidence. It was also found that the experienced judges as a group were significant- 
ly more successful in making the neurotic-psychotic differentiation than the in- 
experienced group (X? = 5.91, p = <.05). 

One other comparison between the two groups of judges deserves mention. 
Table 1 presents the number of idiosyncratic selections for each judge; 7.e., the num- 
ber of times that a particular judge made a selection which was not shared by any 
of the other 14 judges. The difference between the two groups was found to be sig- 
nificant beyond the .001 level of confidence (X? = 36.31) which indicates that the ex- 


perienced judges tended to interpret the profiles in a more homogeneous manner 
than the inexperienced group. 


SUMMARY AND CONCLUSIONS 


This study was concerned with the following problems: Can MMPI profiles be 
used to predict psychiatric diagnosis? If so, is clinical training and experience with 
the MMPI a necessity in such prediction? Are experienced clinicians able to make 





PREDICTING PSYCHIATRIC DIAGNOSES WITH THE MMPI 379 


a larger number of correct predictions than could be made on the basis of a simplified 
approach to the problem? And further, do inexperienced judges show greater var- 
iability and more disagreement in their predictions than a group of more experienced 
judges? Within the limits of the study it was found that all 9 of the experienced 
judges and 4 of the 6 inexperienced judges were able to predict psychiatric diagnosis 
at the .01 level of confidence or better. Nevertheless, when comparing the perform- 
ances of both groups as a whole, it was found there was a significant difference in 
favor of the experienced group in its ability to predict more accurately and also to 
make significantly fewer idiosyncratic selections. These results notwithstanding, it 
was also determined that none of the judges succeeded in predicting more accurately 
than could be done simply by sorting the profiles according to the highest abnormal 
T scale score. The results also raise the question as to the legitimacy of using the 
MMPI for accurate individual diagnostic prediction in clinical consultations. This 
study serves to emphasize the pressing need for obtaining validity estimates for all of 
the clinical psychological tests included in the clinician’s tool kit. 
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GROUP DIFFERENCES IN COLOR CHOICE AND REJECTION 
TARMO A. PASTO AND PAUL KIVISTO 
Sacramento (Calif.) State College Atascadero (Calif.) State Hospital 


INTRODUCTION 

Interpretation of Rorschach protocols has leaned heavily on the diagnostic 
meaning of color. The way in which color was used, and the particular color area 
chosen has played considerable role in personality diagnosis. Jacobi® associated 
blue with thinking, yellow with intuition, red with surging and tearing passions, and 
green with growth and sensation. Rorschach? believed that those people who kept 
strong control over their emotions showed a preference for the blue and green figures 
and avoided the red. Birren“? suggested that manics preferred red; hysterics, green; 
schizophrenics, yellow; and paranoids preferred brown. In another article,®) he 
linked yellow with feeble-mindedness. For an excellent review and evaluation of the 
Jiterature, see Norman and Scott“). This study was undertaken to gather norma- 
tive data concerning group differences on color preferences, using color charts and 
Rorschach card X. 


METHOD AND PROCEDURE 
Two phases characterized the inquiry. The first phase consisted of presenting 
card X of the Rorschach to the subject, asking him to name the color he found most 
pleasing and attractive, and to name the color he found least pleasing or attractive. 
In the first phase, 120 subjects were tested: 40 normals, 40 psychotics, 40 mental 
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defectives, divided evenly in each group as to sex. In the second phase, 120 entirely 
new subjects in the same clinical groupings were tested in the same manner with a 
color chart. The normals were taken from the hospital personnel. The psychotics 
were taken from the disturbed wards, with the added feature that the second group 
of male psychotics was taken from the tuberculosis wards, and all were considered as 
advanced, irreversible schizophrenic conditions, the majority being actively delu- 
sional and diagnosed paranoid or undifferentiated. The catatonic and simple types 
were excluded from the inquiry because of their inability to respond to the cards and 
charts. The mental defectives were tested on their respective wards. After the data 
were collected, percentage response within each group to the various colors was cal- 
culated. 


RESULTS 


Examination of Tables 1 and 2 indicates that blue was the most popular color 
among the three clinical groups. Red came next being preferred, on the color chart, 
by the male mental defectives. Yellow rated high for the female normals on the 

Rorschach; however, not one normal female voted for yellow in the color chart. 
Yellow was rejected more than preferred by the mental defectives, and was of lower 
incidence than red and blue for the same group. The psychotic males and females 
showed a greater range of color choice and rejection, oftentimes giving symbolic 
meanings to their particular selections. Green tended to be rejected by the male de- 
fectives, and had some possibilities as a neurotic indicator when rejected. The prefer- 
ence for gray on the Rorschach also had possibilities as an indicator; yet, on the color 
chart, gray, when chosen by normals, was associated with clothing color and not 
with mood. There was no adequate brown on the Rorschach, but its presence on the 
chart was significantly rejected by the normal males. It tended to be rejected by the 
other groups, also. Gray was significantly rejected in both tables and in all groups. 

Continued later, over a two year period, in a clinic setting, 53 men and 108 
women past the age of 50 tended to choose the color red in the Rorschach, a finding 
that ‘was hard to explain. Their clinical composition consisted of ambulatory schiz- 
ophrenics, severe psychoneurotic conditions, and chronic brain syndromes. Other 
findings included the tendency of organics, hysterics and hypomanics to choose red— 
or the “extensive”’ portion of Rorschach’s differential scale. The choice of yellow 
appeared in infantile and schizoid types; its denial or rejection pointed to a denial 
mechanism at work. 


TABLE 1. Coton PREFERENCE (C+) AND REJECTION (C-) 


IN RorscHacH EXPRESSED AS PERCENTAGES 





Normal Normal Psychotic Psychotic M.D. 
Males Females Males Females Males Females 
. N = 20 N = 20 N = 20 N = 20 N = 20 


Color ; . a. Lee ig ‘4 Le : C+ 


30 15 20 35 


Blue ID 50 10 


Yellow 5 | 


Gray : : 0 30 
Green 5 10 10 


Olive 
Orange 


White 
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TABLE 2. CoLor PREFERENCE (C+) AND Resection (C-) ry Cotor Cuart EXxprEsseED AS 
ata 


Normal Normal Psychotic Psychotic M. D. M. D. 
Males Females Males Females Males Females 
N = 20 N = 20 N = 20 N N ‘ N = 20 


Color C+ iC c+ | ce M4 —_ 


20 10 25 
Blue 50 


Yellow 5 
Gray 10 
Gren | 15 
( aie 0 


Brown 0 5 2! : 25 f { ;} 20 | 





Side remarks were illuminating, and consisted of (1) complaints as to the rela- 
tivity of color choice, (2) symbolisms, (3) disgust, such as “dirty purple,” “nigger- 
pink,” “bile-green,’’ (4) elation, such as “‘I like all colors”... ‘‘pretty’ ... “delight- 
ful, ’’ (5) suspicion, such as ‘“‘What’s behind all this?’’, including the sharp side 
glances, (6) evasion, such as “I haven’t any preferences”... ‘“They’re all the same 
to me.” 


SUMMARY AND CONCLUSION 


Two hundred forty subjects were tested as to color preference on Rorschach 
card X and a color chart. The subjects were grouped into (1) male and female, 
(2) normal, (3) psychotics, (4) mental defectives. Blue and red were the popular 
choices among all the groups: gray and brown were the colors most rejected. Normal 
women shifted their preferences more than normal men. The psychotics were char- 
acterized by a greater range of color choice and rejection. The mental defectives re- 
jected yellow more than the other groups. 
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PATTERN ANALYSIS OF THE RORSCHACH 
ROBERT D. WIRT 


University of Minnesota 


PROBLEM 


This is a report of an investigation of differences in patterning of response tend- 
ency for Rorschach cards and for Rorschach determinants among groups of normal 
and abnormal persons. Interpretation of personality by Rorschach’s technique 
would suggest that differences should exist among psychiatric and normal groups in 
determinant psychograms, but that the stimulus quality of the cards themselves 
should not differentially affect responsiveness among groups. 


MeEtTHOD 


Three groups of subjects selected from an earlier study ©’ were administered the 
Rorschach by the Klopfer method “?. Group I consisted of 32 normal male subjects 
living in the community who never had had psychiatric treatment. Group II in- 
cluded 32 neurotic patients; and 32 patients diagnosed schizophrenic constituted 
Group III. All subjects in Groups II and III were hospitalized male veterans and 
the Rorschach was not used in arriving at their diagnoses. An effort was made to 
match each group against the 1950 census for age, occupational level, and education. 
The final samples selected were somewhat better educated than the population as a 
whole and the neurotic group was more heavily weighted with younger age levels 
than the others. 


RESULTS 


Analysis of covariance and chi-square tests of differences in responsiveness 
among age, educational and socio-economic groups failed to produce statistically 
significant results. 

Pattern analyses were made using the procedure developed by Block, Levine, 
and McNemar"?. Since this technique requires normal distributions it was neces- 
sary to combine determinant scores after Cass°? and to convert the raw scores into 
normal scores by McCall’s T-score transformation. Figures 1 and 2 picture the 
profiles obtained for Rorschach determinants and the Rorschach cards respectively. 

Table 1 is the variance table for the two profiles. The F test for the card an- 
alysis (Figure 2) gives a ratio of 0.544, which for 18 and 837 degrees of freedom falls 
far short of an acceptable level of significance. However, the ratio for the determin- 
ant pattern (Figure 1) is 7.41. This ratio, for 10 and 465 degrees of freedom is highly 
significant with a probability of less than .001. In some instances the configurations 
obtained would be predicted by Rorschach theory. But the relative patterns are not 
wholly those which might be expected. For example, the normal group, rather than 
the neurotic group, scored highest on k + K + FK although it would be reasonable 
to predict that these indicators of anxiety would show up most clearly in a neurotic 
population. 


TABLE 1. VARIANCE TABLE FOR ROERSCHACH CARDS AND DETERMINANTS 








Sum of Squares d. f. Variance Estimate 
Source Cards Determinants Cards Determinants| Cards Determinants 








Variable 4115.255 20336.083 9 5 457 .251 4067 .216 
Groups 1625 .685 2495 .775 2 2 812.829 1247 .887 
Individuals 33282.266 15323.765 93 93 357 .949 164.771 
Variable by 

Group Interaction | 486 .505 2076 .360 18 10 27 .0225 207 .636 
Residual 41582 .640 13026 .392 837 465 28.013 
Total 81082.224 532258.375 576 | 
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ROBERT D. WIRT 


SUMMARY AND CONCLUSIONS 


Pattern analyses of Rorschach cards and determinants show differences in the 
determinant patterns, but no differences in card patterns for groups of normal, 
neurotic, and psychotic persons. These findings are consistent with Rorschach ex- 
pectations, however the configurations of determinant patterns are not, in some 
instances, what would be predicted. The methodology used here might be of consid- 
erable help in evaluating many hypotheses having currency in clinical practice and 
Rorschach theory where a configural approach to interpretation is required. 
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A MEASURE OF SEVERITY OF ILLNESS 
MAURICE LORR, JAMES Q. HOLSOPPLE, AND ELIZABETH TURK 


Veterans Administration, Veterans Benefits Office, Washington, D. C. 


PROBLEM 


A typical clinical problem in the initial evaluation of a mentally ill patient is the 
determination of severity of illness. Is the patient’s disturbance or disorder mild, 
moderate, or severe? At a later stage.the problem becomes one of assessing the 
changes resulting from treatment. Has the patient become better or worse, or have 
no changes occurred? A related question concerns the prevailing determinants clin- 
icians use in judging severity of illness, knowingly or unknowingly, and regardless of 
theoretical bias or persuasion. Is the concept of severity unidimensional and des- 
cribable in terms of one factor, or is it multidimensional? 

Previous research concerned with the development of objectified rating scales 
had indicated that clinicians could make complex judgments at acceptable levels of 
interjudge reliability. Furthermore, these judgments could be used to explore and 
identify the dimensions of mental illness®. It was from this base that the investiga- 
tors set as their task the development of a measure of severity of mental illness de- 
rived from clinical judgments based on interview observations and inferences. 


INITIAL PROCEDURES 


Psychotherapists in an east coast Veterans Administration mental hygiene clinic 
(Clinic A) were asked to identify any male patients undergoing psychotherapy whom 
they would regard as among either the least ill or as among the most severely ill of 
patients seen in the clinic. Severity of illness was to be estimated in terms of custom- 
ary criteria such as the degree of conflict, maladjustment or personality disintegra- 
tion. The acuteness, chronicity or prognosis of the illness was not to be considered. 
Ten mildly ill and 10 severely ill patients were identified in this manner. The ther- 
apists (psychologists, psychiatrists, and social workers) then described each patient 
on the Multidimensional Scale for Rating Psychiatric Patients, Outpatient Form 
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(MSRPP). The Multidimensional Scale, elsewhere reported ®?, consists of 49 brief 
graphic scales descriptive of patient behavior, symptoms, and inferred attitudes and 
needs. It has been found to measure ten relatively independent factors. 

The distributions of ratings of the two groups on each of the scales and the 
group profiles on the factors were carefully examined. By means of a configural ap- 
proach 14 scales were selected, which in combination completely separated the 
patients designated mildly ill from those designated severely ill. To check these 
findings another east coast clinic, Clinic B, was asked to identify and rate a group of 
mildly ill and a group of severely ill patients in their treatment program. The nine 
mildly ill patients and the nine severely ill patients were rated and it was found that 
the same 14 scale items provided a complete separation. It was clear from these 
findings that clinicians widely separated geographically and differing as to clinical 
experience and training could agree in their descriptions of patients who are classified 
as mildly or severely ill. 


THE Cross-VALIDATION 


The problem next set was to determine whether the same scales would differen- 
tiate equally well if the raters first described their patients and then judged the sever- 
ity of illness. The raters of the initial group of patients knew, of course, whether they 
had classified a patient as mildly ill or as severely ill. There was the possibility that 
this knowledge affected their descriptive ratings. The more usual procedure for valid- 
ating a set of variables is to collect independently experimental data on predictor 
variables and experimental data on the criterion variable. In this study the judg- 
ment of severity is the criterion and the descriptive scales are the associated pre- 
dictor variables. 

To eliminate the effect of knowledge of the patients’ severity classification on 
the raters’ descriptions, data on 49 patients were secured from two additional clinics. 
One clinic is located in the middle west and one is located in the Rocky Mountain 
region. Each psychotherapist was asked to describe several of his patients on the 
MSRPP. After the completed forms had been collected from all therapists, each 
rater was asked to categorize his patient as to severity of illness. Of 100 consecutive 
cases last seen in the clinic, would the patient rank among the 15 mildest cases, the 
15 severest cases, or among the 35 cases in either of two intermediate groups? As 
before, severity was defined in terms of degree of conflict, maladjustment, or person- 
ality disintegration; acuteness or chronicity was not to be represented in the judg- 
ment. 

When the four-category severity of illness ratings of the 49 patients were cor- 
related with their total scores on the 14 scales previously identified as discriminating, 


TasBie 1. BrsertiAL CORRELATIONS BETWEEN SEVERITY OF ILLNESS AND 14 
RatinGc ScaLes FOR INITIAL AND CrosS-VALIDATION GROUPS 








Initial Sample Cross-validation 
(N = 38) Sample (N = 49) 





Lacks impulse control |; 3 | : .40 
Marked reality distortion | i : .73 
Deeply depressed ) : 12 
Self-preoccupied 8 .65 
Relatively anxious A 15 
Low achievement motivation | 2 “f 21 
Slow to modify behavior i 3 9% 44 
Relatively hostile < ; 48 
World perceived as hostile 2 | 8 54 
Resentful of authority | 35 8 31 
Anxious over hostile impulses 5 .O1 
Never considers tomorrow ¢ 5 .43 
Painfully self-conscious : 10 
Ideas of reference | ‘ 26 
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a product moment correlation of .70 was obtained, significant above the .001 level. 
The drop in the validity coefficient from complete separation to a correlation of .70 
is not difficult to account for. The original group of 38 represented the extremes of 
the population under treatment. On the other hand, the typical case in the cross- 
validation sample was rated as ‘“‘mildly ill’’. Utilization of cases in the middle range 
of severity accounts for much of the attenuation in the correlation coefficient. 

Table 1 presents the biserial correlations between scales and severity of illness 
for the total initial and total cross-validation groups. Half of the scales drop below 
the 5 per cent level of significance. However, because a configural approach was used 
in selecting the scores the total validity is not seriously impaired. The 14 scales 
showing the highest biserial correlations with severity of illness rating for the cross- 
validation group were identified. When the cross-validation group was rescored on 
these 14 scales the product moment correlation obtained was .68. When the initial 
validation sample of 38 patients was rescored with this key, a biserial coefficient of 
.93 was obtained. The difference between these two coefficients is most certainly due 
to the presence of more extreme members in the initial group. The scales not in- 
cluded in the initially developed key were lack of concern for others, frequent mood 
swings, tendency to dramatize self, suspicion, marked emotional over-reaction to 
situations, overt expression of hostility. Eight scales are common to the two keys. 

A factor analysis of the Multidimensional Scales“ has resulted in the identifica- 
tion of ten personality and symptom factors. The most prominent factor from the 
standpoint of total variance accounted for and number of items is a parameter des- 
criptive of degree of reality distortion and disturbance of interpersonal relations 
(factor B). This factor is defined by distortions in thinking and perception, tendencies 
toward suspiciousness, isolation from others, depressive mood, feelings of depersonal- 
ization, ideas of reference, overt expression of hostility, self preoccupation, defiance 
of authority, and rigidity. The reality distortion factor is represented by six scales 
in the initial key. 

The second factor, E, represented in the severity key is a measure of the degree 
to which the individual assumes control of and responsibility for his own conduct 
in relation to the future. In the severity index it is defined by a relative lack of con- 
cern about the future, a lack of motivation towards long term goals such as advance- 
ment, acceptance, recognition, power, and an inability to control his immediate im- 
pulses, needs or drives in favor of more remote or future satisfactions. 

The clinician’s judgment of severity of illness, at least as defined in these four 
outpatient adult clinics, is thus largely determined by (a) the degree to which the 
patient distorts in his perception and thinking of the world around him; and (b) the 
extent to which he exhibits lack of control of impulses in favor of remote goals or 
satisfactions. 


SUMMARY AND CONCLUSIONS 


The purpose of this study was to develop a measure of severity of mental illness 
derived from clinical judgments based on interview observations. Fourteen rating 
scales were identified which in combination completely separated two groups of 
patients judged to be the mildest and the most severely ill cases in the clinic. Essen- 
tially the same scales were identified when raters first described their patients and 
then judged the severity of illness. 

It is concluded that (1) a relatively small number of descriptive scales may be 
used by clinicians with considerable confidence as an index of severity of mental ill- 
ness; and (2) the concept of severity is multidimensional and describable in terms 
of at least two factors. 
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THE REVISED HOOPER VISUAL ORGANIZATION TEST AS A MEASURE 
OF BRAIN DAMAGE! 
ROBERT G. WALKER 
VA Hospital, Brockton, Massachusetts 


PROBLEM 

A new psychological test for detecting brain damage, the Hooper Visual Organ- 
ization Test, has recently been reported by Hooper“. The test consists of 30 draw- 
ings of common, everyday objects such as a table, saw, sailboat, ete., each of which 
is divided into two or more sections. The drawings are analogous to the presentation 
set-up of Object Assembly in the Wechsler-Bellevue“? except that S must recognize 
what the object is by visual processes alone and is unable to manipulate the parts 
of the whole motorically. Following the hypothesis that vocabulary is relatively 
unaffected by brain damage while various visual-motor functions are @: 4 5 ©) the 
test as standardized by Hooper was given with the Shipley-Hartford Vocabulary 
Test ‘®) as a measure of vocabulary level. 

The vocabulary raw score is converted to a Hooper expected score and is com- 
pared with the actual score obtained on the Hooper (the number of items correctly 
identified) yielding a D score (difference score). The greater the negative difference, 
i.e., the lower the Hooper performance in respect to vocabulary level, the greater the 
probability of organic brain damage. The test was standardized on patients with 
known brain damage and a control group of junior college students. The purpose 
of the present study was to obtain validity material on a new group, including pa- 
tients with less extensive or less well-defined brain damage This was felt desirable 
inasmuch as the psychologist is rarely called on to examine patients with known 
brain damage, but is frequently asked to evaluate patients who have a clinical his- 
tory or presenting symptom picture that is suggestive of cortical involvement. 


PROCEDURE 

The Hooper Visual Organization Test records of 38 neuropsychiatric patients 
were selected from the files of the Clinical Psychology Service of the Boston Veterans 
Administration Hospital. They were scored in accordance with the procedure pre- 
viously described. A final discharge diagnosis had been made on 90°, of the patients 
while the remaining 10°; were all considered continued treatment cases with diag- 
nosis well established. 

The chief neurologist? categorized the patients into two basic groups: Those who 
definitely were felt to be free of any cortical involvement and those who probably 
or possibly were afflicted with brain damage. This classification was based on all the 
material available in the clinical records with the single exception of the psychologi- 
cal report. Most of these patients had received as a minimum a routine neurological 
examination and a routine EEG. On others more extensive studies such as pneumo- 
encephalogram, special sleep or metrazol EEGs, ete., had been carried out. The 
neurologist felt that on the whole the 38 patients so selected were representative of 
routine admissions to the Neurological Service. 

The cases were rated as no cortical involvement (I), definite cortical involve- 
ment (II), and cortical involvement suspected (III). Because of the small N, sub- 
groups II and III were combined into one single group labeled ‘‘cortical involvement 
suspected” and compared by Chi Square with subgroup I labeled ‘“‘control’’. No 
significant difference was found (Chi Square .38). 

In a personal communication Dr. Hooper had mentioned that certain qualita- 
tive features were helpful in addition to the quantitative score. These signs were 
called ‘‘concrete”’ responses and consisted of a response to a single part of the draw- 
ing without regard for the remaining parts. For example, Item x 24 is a drawing of 
a mouse. That part of the drawing depicting the rear quarters and tail has a fairly 
close resemblance to a pipe used for smoking. If § replies ‘‘pipe’’ to the item he 


1This study was effected at the Veterans Administration Hospital, Boston, Massachusetts. 
2The author wishes to express his very great appreciation to Dr. Fred Quadfassel for his co- 
operation in this phase of the study. 
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demonstrates a loss of set, and inability to integrate the disparate parts of the 
whole. Such a response is classified ‘‘concrete’”’ ®. These reactions to the items were 


established by a panel of three judges* and the coded records were scored in terms 
of number of “‘concrete’”’ responses. 


RESULTS 

A frequency distribution of the scores revealed a distinctly bimodal distribution 
with approximately half of the group giving two or more “concrete’’ responses and 
half of the group one or less. A comparison between diagnosis (that is, “‘brain dam- 
age suspected” versus the “control” group) and presence of ‘‘concretes” (two or 
more versus one or less) yielded a Chi Square with Yates Correction for Continuity 
of 6.60 significant at the 2% level of confidence. Variables which were considered 
possible factors in the production of ‘‘concrete’’ responses, such as age and intelli- 
gence were found to be insignificantly related as evidenced by a Chi Square of .22 
between age and “‘concretes’’, and a Chi Square of 1.36 between Shipley-Hartford 
vocabulary level and ‘“‘concretes’’. Both age and number of “‘concretes”’ were treated 
the same. That is, scores falling above the median were considered ‘‘deviant’’. Vo- 
cabulary scores, on the other hand, falling below the median were catergorized 
“deviant”’. 

The results indicate that the Hooper test, as revised, significantly differentiates 
between the two groups sampled, but the very low cut-off point (more than one 
“concrete” suggests cortical involvement) renders the use of the test in an individual 
diagnostic setting highly vulnerable to errors in administration and scoring. It 
would appear, however, that this approach to scoring makes the Hooper test a more 
sensitive instrument and suggests that further revision, for example, increasing the 
number of items, might be very worthwhile. 


SUMMARY 


On the basis of clinical evaluation independent of psychological testing, 38 
patients admitted to the Neurological Service of the Boston Veterans Administra- 
tion Hospital who had been administered the Hooper Visual Organization Test were 
divided into two equal groups: a “‘control’”’ group and a group suspected of cortical 
involvement. The Hooper Visual Organization Tests were scored according to the 
original standardization procedure and the results compared with the clinical eval- 
uations. This comparison failed to meet tests of significance. 

The Hooper records were then rescored on the basis of a quantitative analysis 
of a test characteristic previously considered by Hooper a qualitative aspect, namely, 
the presence of ‘‘concrete”’ responses to the test material. A comparison between the 
revised Hooper Visual Organization Test scores and the clinical evaluations indicated 
that normals and individuals suspected of cortical involvement could be differen- 
tiated at a statistically significant level. 
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PSYCHOLOGICAL FINDINGS IN A CASE OF VON RECKLINGHAUSEN’S 
DISEASE AND HYPERPITUITARISM 


THOMAS E. JORDAN 


Ball State Teachers College, Muncie, Indiana 


INTRODUCTION 
Von Recklinghausen’s disease is one of the more interesting physical disorders 
encountered in the study of mental deficiency. According to Kugelmass®: P- 88) it 
” _ one of the triad of congenital ectodermatoses with tuberous sclerosis and 


m4 
cerebral angiomatosis.”’ Quite frequently there is a “ .. . low standard of physical 


and mental development in the subjects of this disease.’ »- 8) Hypothyroidism 
is present quite frequently as a concomitant condition. The purpose of this paper is 
to present a summary of psychological material gathered while working with a boy 
diagnosed as having Von Recklinghausen’s disease and hyperpituitarism. It is the 
writer’s belief that no extant study of this condition presents enough of the data in 
which psychologists are particlarly interested. 


CasE Report 


The material reported in this paper comes from observations of the subject 
psychometric techniques, and a physician’s report. 


W. M., a white male, the youngest of three boys, was born September 30, 1943, 
and was twelve years and three months old when he came to the attention of the 
writer. At this time he was 68 inches tal! and weighed 220 lbs. Because of his unusual 
appearance a medical opinion was requested with the following findings: 


“On physical examination his appearance was that of a dull, obese, husky male. A definite 
speech impediment was noted characterized by a deliberate manner indicating some speech 
therapy. His height and weight were far out of normal percentile range. Fundoscopic exam re- 
vealed a decrease in number of retinal vessels and a pallor of the optic nerve discs. His blood 
pressure was 130/90 and his skin revealed multiple pigmented nevi. His genitalia were extremely 
mature showing adult characteristics. 

To summarize my clinical findings: there was mental and motor retardation, precocious de- 
velopment, both sexually and physically, gigantism, elevated blood pressure, and multiple pig- 
mented nevi... hyperpituitarism and Von Recklinghausen’s Disease.” 


The physical appearance of the siblings is also noteworthy. 
year old boy who stands 6’2” and weighs 160 lbs. 
215 lbs. 


do satisfactory school work which, while no guarantee of high intellect is at least 


There is a fourteen 
The oldest boy is 62” and weighs 
Unlike the subject the siblings apparently are of normal intelligence. Both 


indicative of a degree of intelligence adequate for most purposes. The parents are 
normal in appearance, slight overweight being the only observable physical anomaly. 


The father works as an “operator” at a chemical plant, and the mother does part- 
time work at a laundry. 


Psychometric Report. Téhe following psychological techniques were employed: 
The Wechsler Intelligence Scale for Children, The Coloured Progressive Matrices 
(1947), the Rorschach test, The Michigan Picture Test, and observations made 
during the working sessions. An articulation test and a pure-tone audiometric test 
were also administered. 

The results of the Wechsler Intelligence scale for Children were as follows: 
Verbal Seale IQ, 72 Information 
Performance Scale 1Q, 61 Comprehension Picture Arrangement 
Full Scale IQ, 64 Arithmetic Block Design 

Similarities 4 Object Assembly 
Vocabulary Coding 


Picture Completion 


The test scores indicate widely differing levels of ability. 


In general he operates at 
the level described by Wechsler“: »- '® as “ 


mentally retarded.”” The performance 
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items indicated consistently low ability with the general level of performance items 
clearly in the range usually allotted to the mentally retarded. 

The verbal scale, as a whole, presents a slightly better picture. An 1Q of 72 puts 
the subject in the borderline category. Comprehension, a subtest which places em- 
phasis on items of day-by-day living, was below average, but not to a gross degree. 
It may be interpreted to mean, when taken by itself, at least an acceptable degree of 
skill in the social side of life, an awareness of the conventions of society. The Inform- 
ation score while not high, suggests an impairment rather than a complete failure to 
grasp everyday items of information. Vocabulary is lower, while Arithmetic and 
Similarities suggest a definite lack of ability for situations requiring those skills. 

Some rather interesting observations were made during this testing period. The 
client was most cooperative, entering into all the tasks with enthusiasm. However, 
he tended to withdraw very quickly from frustrating situations or situations where 
his failure was unmistakable. His tolerance for frustration was not high. His answers 
to questions, complicated by articulation difficulties, were simple and direct. No fur- 
ther questioning was profitable at any time after the client had answered a question 
to his own satisfaction. The results of the WISC suggest that the client is a fairly 
well socialized boy whose lack of ability comes more sharply into focus when the life- 
situation changes from one which emphasizes the veneer of sociability and affability 
to one requiring specifie skills and abilities of the kind usually considered indicative 
of certain intellectual ability. 

The Coloured Progressive Matrices (1947) a non-verbal “. . . test of fairly com- 
plex reasoning abilities .. .”’ “> P- 4) consists of three series of incomplete designs, 
A, Ab, and B. The client’s performance on these culture-free, performance items was 
as follows: set A, 8 correct choices, Ab, 5 correct responses, and 5 on set B, for a total 
score of 18. This is a score at the minus 2¢ level for children of the client’s age. As a 
qualitative note his errors were gross, being a failure to grasp the nature of the 
problems rather than near misses. The Matrices indicated clearly defective ability 
in a situation where reasoning was essential. 

The Rorschach plates were administered to gather some information about the 
subject’s perceptual and apperceptual processes. The responses were quite atypical, 
and, in summary, were as follows: 


R=8 Init. R.T. (Average) 11.6 sec. 

Total time 125 sec. o (8 9 10) 10 see. 

T/R 15.6 see. (2389 10) 10 sec. 

No. R. 8.9.10 _ 

—- = 

No. R. 2.3.8.9.10 _ 
R 


a% (14567) 8.8 sec. 


W/m 5:0 
M/FM 0:0 
W = 60 (%) M/C 0:0 
D=40 « 7% 100 
d= 0 “ F+% 

p=12 “ A% 

Cards rejected, #7 and #10. H % 


The subject’s responses were quite brief and attempts to produce elaborations during 
the inquiry were quite unsuccessful. The rejections were not accompanied by any 
display of emotion; the subject simply replied to the question, ‘‘What might this 
be?” with “I don’t know,” after studying the card for a few moments. 
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The significant characteristics of this record were (1) the small number of R, 
(2) the brevity and superficiality of responses, (3) absence of color as a determinant, 
(4) stereotypy in response, (5) rejection of plates VII and X, (6) F°%, (7) High A% 
and (8) general poverty of content. The poverty of the client’s inner life is apparent. 
He operates at an immature level, having comparatively little ability for problem 
solving, and in the absence of other indications, his card rejection is felt to be an 
escape mechanism from a frustrating situation. (This tendency was also apparent in 
his WISC responses. ) 

The Michigan Picture Test is a set of drawings analogous to the TAT. There are 
differences in rationale, but the test materials are essentially the same, being human 
figures in situations open to numerous explanations. In view of the client’s demon- 
strated economy of expression and low tolerance for frustrating situations, the pic- 
tures were selected on the basis of obvious relevance to a boy’s life. The following 
vards were selected: *7, ¥1, *2, *4B, *¥3, ¥8B, *11G,and *12. The responses 
were what might have been expected in view of the other results. The responses were 
brief, superficial and not of sufficient caliber to justify any judgments about the 
client’s interpersonal relations. The only potentially useful data which were obtained 
was in the reaction times to the cards. The mean r.t. to all cards was 5.1’, and the 
range was from 3” to 8”. Since it is not clear whether the 8” r.t. was the consequence 
of the difficulty level of the card or the dynamic significance of the picture, no con- 
clusions can be drawn about the longest r.t. 


Diagnostic Summary. The overall picture of the boy presented by the psycho- 
metric results and supplemented by interviews is that he is a quiet, passive, affable 
boy of limited intelligence. He seemed to cooperate well, and there were no indica- 
tions of overt conflicts. He has a low threshold for frustration which will probably 
interfere with any attempt at education. He is educable, in a minimal sense, and 
while capable of solving simple problems, he probably will not be capable of working 


without fairly close supervision. 
SUMMARY 

The psychological characteristics of a boy diagnosed as having Von Reckling- 
hausen’s disease and hyperpituitarism were presented. The relative value of psy- 
chometric techniques as they contributed to an understanding of the boy is interest- 
ing. The WISC provided an opportunity to assess global intellectual capacity and 
the differing levels of abilities for different tasks. Cues relating to the subject’s re- 
action to frustrating situations appeared. The Matrices gave an opportunity to see 
how the client responded in a problem-solving situation in which verbal abilities 
played no part. Performance there was low, and the errors were gross. The Michigan 
Picture Test failed to provide any useful data, and while not intended to be used as 
the writer used it, it did not appear to be a useful instrument in cases of known or 
suspected mental retardation. Rorschach’s Test, with its emphasis on perceptual 
and apperceptual processes, was most helpful. The dearth of intellectual content, and 
the absence of affective cues were readily apparent. 
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USE OF THE MINNESOTA MULTIPHASIC PERSONALITY INVENTORY 
AS AN INDEX TO “ESCAPISM” 


HERBERT S. BEALL AND JAMES H. PANTON 


Reception Center, Central Prison, Raleigh, North Carolina 


PROBLEM 

A preliminary attempt has been made to derive an index of ‘escapism’’ to facili- 
tate accurate custodial classification and subsequent assignment of male felon in- 
mates being processed by the Reception Center, Central Prison, Raleigh, North 
Carolina. In classifying inmates processed through the Reception Center emphasis 
is placed on determining the degree of custodial restraint necessary to retain the 
inmate during his period of incarceration. Upon the determination of custody ratings, 
inmates are assigned to units which afford custodial facilities believed necessary. 
Reliance is placed on the evaluation of social histories and overt attitudes in deter- 
mining the custodial rating an inmate shall have. Significant error is indicated in the 
uses of these methods to warrant the derivation of a supplemental, more objective, 
and more valid technique. It was hypothesized from observation that the propensity 
for escape derives, at least in part, from a constellation of personality variables, 
which if identified could serve to differentiate between potential escapist and non- 
escapist. 

PROCEDURE 

The booklet form of the Minnesota Multiphasic Personality Inventory was ad- 
ministered to 413 male felons admitted to Central Prison during the period January 
1955 through December 1955 and included all literate admissions with intelligence 
quotients above eighty. Of this group, 103 had escaped from custody while serving 
previous or current sentences. These inmates served as the ‘‘experimental group’”’. 
At random 100 non-escapees were selected as the ‘‘control group”. At the time of the 
experiment none of this group had records of escape and each had been determined 


“good” custody risks. Chi-square was employed to determine the difference in the 


frequency with which the two groups responded to each of the five-hundred and sixty 
six ‘‘personality variables’. 


RESULTS 
As demonstrated in Table 1, forty-two items were responded to significantly 
different by the two groups at the 1°% level of confidence. Individual scores were 
computed for the individuals of both groups on the basis of their response to each of 
the forty-two variables. Arbitrarily, one point was allowed for each variable when 
answered in the direction in which “escapist” differentiated themselves from “non- 
escapist’’, either true or false, as indicated in Table 1. 


TABLE 1. Cur Squares or 42 M. M. P. I. Irems Toat WERE ANSWERED SUFFICIENTLY DIFFERENT 
At Tue 1% Leven oF ConFIDENCE By 100 Non-Escaprgeges AND 103 EscaPpEEs 
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The point of greater dichotomy being established at the score of seventeen 
(Table 2) served to distinguish 76.7% of the escapee group and 73% of the non- 
escapee group. It is noteworthy that 26% of the non-escapee group achieved lower 
scores than the lowest score of any escapee. 


TaBLe 2. FREQUENCY DISTRIBUTION OF Raw Scores oF NON-ESCAPEES AND EscaPEES ON 
42 M. M. P. I. Irems 
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To insure against drawing conclusions based on chance differences found in 
responses to the variables by the original groups, cross validation was determined on 
an entirely new sample of 103 ‘eligibles’ admitted to Central Prison during January- 
March, 1956. This sample consisted of fifty-seven inmates who had served previous 
sentences who had escaped and forty-six inmates who had served previously without 
escaping. The results compared favorably with original findings. The point of greater 
dichotomy was established at a score of eighteen. At this point 77.2% of the escapees 
and 78.3% of the non-escapees were distinguished. Table 3 gives a comparison of 

TABLE 3. COMPARISONS OF MEANS AND STANDARD DEVIATIONS 











Groups Means 


| 
| Standard 
| Deviations 


Original Non-Escapee Experimental Sample (N = 100) 0° 4.75 
Remainder of Non-Escapee Population from which Original 
Experimental Sample was Drawn (N = 207) 5.37 4.25 


Validation Non-Escapee Sample (N = 46) 14.7 | 3.95 











Original Escapee Experimental Sample (N = 103) | 19, 4.33 








Validation Escapee Sample (N = 57 | 26:4 4.07 


! 








394 HERBERT 8S. BEALL AND JAMES H. PANTON 


the mean scores and standard deviations of the escapee and non-escapee groups of 
the original and validation samples. The means of the escape and non-escapee 
groups of both samples were significantly different at the 1%) level of confidence. 
Application of the Kuder-Richardson Method to determine inner item consistency 
reveals a reliability coefficient of .76 for the original group of 103 escapists and 100 
non-escapists. 

A comparison of the mean profiles of the two groups revealed marked similarity, 
indicating that the forty-two variables found to differentiate between escapist and 
non-escapist are derived independent of any single scale or groups of scales of the 
MMPI. 

CONCLUSIONS 

1. Propensity for escape among imprisoned felons derives, at least in part, 
from constellations of personality variables. 

2. “ Forty-two of these variables appear in the Minnesota Multiphasic Person- 
ality Inventory. 

3. Evaluation of response to forty-two variables will serve as an index to 
escapism and can be used as a supplemental, more objective method of assigning 
custody ratings to new prison admissions. 

4. The forty-two variables used in the index appear to derive independent of 
any single scale or groups of scales of the MMPI. 


TWO ANXIETY SCALES CORRELATED AND EXAMINED 
FOR SEX DIFFERENCES 
DANIEL SINICK 
New York Regional Office, Veterans Administration 


Two correlations between the Taylor Manifest Anxiety Scale“? and the Test 
Anxiety Questionnaire devised by Sarason and Mandler“? have previously been re- 
ported ®, .59 and .44 respectively. Although sex differences on the Test Anxiety 
Questionnaire have not been reported, they have been for the Taylor scale. In three 
studies“: *» §), the mean Taylor scores of female subjects were higher than the 
mean scores of male subjects, but not significantly so. In a fourth study“, the 
female means for various groups were all significantly higher than the male means. 


METHOD 

As part of a study! reported elsewhere’, the Test Anxiety Questionnaire and 
the Taylor scale were administered, in that order, to 211 community college students, 
mostly freshmen, assembled in six sections of an introductory psychology course. 
Fifty subjects were females and 161 males. A single class period sufficed for all the 
subjects to complete both scales. 

The 50 MMPI items selected and revised by Taylor“ were employed without 
filler items. McCreary and Bendig“? found a correlation of .95 between a 50-item 
form and a 225-item form containing 175 unscored items. The Test Anxiety Ques- 
tionnaire has 35 scored items and four fillers. 


RESULTS 


The Pearson product-moment correlation between the Taylor scale and the 
Test Anxiety Questionnaire for the population sample described is .43. For males 
alone the r is .45, for females .35. 


‘D. Sinick. The effect of encouragement on the test performance of subjects with varying amounts 
of anxiety. Ph.D. thesis, New York University, 1955. 
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The sex differences here reported (see Table 1) agree with previous findings. On 
both the Taylor scale and the Test Anxiety Questionnaire, the female mean was 
significantly higher than the male mean. While the female standard deviation on 
the Taylor scale was greater than that of the males, on the Test Anxiety Question- 
naire the reverse was true. 


TABLE 1. Sex DIFFERENCES ON THE TAYLOR MANIFEST ANXIETY SCALE AND ON THE SARASON- 
MANDLER Test ANXIETY QUESTIONNAIRE 





Females (N 50) Males (N = 161) 


Measure Me an , om ‘SD ] t 


T aylor MAS 16. 02 


SM TAQ 





SUMMARY 
An r of .43 is reported between the Taylor Manifest Anxiety Scale and the Test 
Anxiety Questionnaire devised by Sarason and Mandler. The population sample 
consisted of 161 male and 50 female college students. For males alone the r is .45, 
for females .35. On each of the scales, the female mean was significantly higher than 
the male mean. 
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FURTHER EXAMINATION OF THE TREE-SCAR-TRAUMA HYPOTHESIS 
B. J. BOLIN!, ANN SCHNEPS’, AND W. E. THORNE 
Central State Hospital, Lakeland, Kentucky 


INTRODUCTION 

One of Buck’s“? hypotheses concerning his House-Tree-Person technique is 
that tortuous, twisted tree trunks, broken branches, etc., may represent repressed 
or partially repressed traumatic events from the subject’s past. He presents the case 
of a subject who spontaneously drew two scars on the tree. Questioning revealed 
that for this subject the scar nearest the base of the tree symbolized the death of a 
playmate when the subject was four years old. The other scar, farther up on the 
tree, represented the psychic trauma suffered by the subject at age 15 when his 
brother died. The implication is that the position of scars on the tree may relate to 
the time in life when traumata were sustained, with the tree symbolizing life span 
or the passage of the years. 

'We acknowledge with thanks the assistance of Dorothy Scott in this study. 

2Now of Great Neck, New York. 
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Levine and Galanter®? conducted the first published study of Buck’s foregoing 
hypotheses, it appears. They obtained tree drawings from 27 amputees, for whom it 
was assumed the amputations represented distinct traumata. Unfortunately, these 
subjects spontaneously produced only seven scars, seriously limiting the significance 
of the study. The authors report a rank-order correlation coefficient of .73 between 
the scar height and the known date of the amputation in the seven cases, with none 
of the scars being located within two years of the age that would be predicted by the 
hypothesis. 

Lyons? sought to test the hypothesis by attempting to induce scars. He had 
each of his 50 subjects to draw a tree, to assume the tree had once been hit by light- 
ning, and to place an X-mark where the lightning would have struck. He then asked 
the subject, among 19 other questions, ‘‘What is the worst single thing that ever 
happened to you in your whole life?” and ‘“‘When did it happen?” Lyons found a 
product-moment correlation of .54 between the relative height of the “induced scar” 
and the relative age of the worst event, which was significant at the 1% level of 
confidence. 


SUBJECTS AND PROCEDURE 


We applied the major steps of Lyons’s procedure to a group of hospitalized 
patients and to a group of presumably normal persons. Group A consisted of 51 
mental patients, 80 per cent of whom had been diagnosed as psychotic. Diagnostic 
categories were represented as follows: schizophrenia, 33; chronic brain syndrome, 
with psychosis, 3; mental deficiency, with psychosis, 2; psychotic depressive re- 
action, 1; involutional psychosis, 1; psychopathic personality, with psychosis, 1; 
personality disorder, without psychosis, 4; psychoneurosis, 3; mental deficiency, 
with behavioral reaction, 2; and chronic brain syndrome, without psychosis, 1. We 
asked each of these patients to draw a tree. The drawing completed, we asked the 
subject to assume that lightning had once struck the tree, and to place a cross mark 
at the point where he felt the lightning would have struck. Then we asked him to 
recall the worst single thing that had ever happened to him and when it had hap- 
pened. 

Group B consisted of 17 graduate nurses and 14 psychiatric aides in training. 
These subjects, in small classroom groups, drew trees and, instructed as Group A, 
placed an X-mark “where the lightning would have struck.” They indicated the 
time but not the nature of the ‘‘worst event.”’ 

A third group, Group C, consisted of 39 patients diagnosed as follows: schizo- 
phrenia, 33; chronic brain syndrome, with psychosis, 2; mental deficiency, with 
psychosis, 2; manic-depressive psychosis, manic, 1; paranoid condition, 1. We inter- 
viewed these patients individually and instructed each to place an X-mark wherever 
he wished on a prepared (duplicated) tree drawing. At least several days later, we 
again approached each subject and asked him to state when the worst event of his 
life had occurred. 

For each group, we measured in millimeters the trees from their lowest to their 
highest point (HT) and the distance from the lowest point of the tree to the crossing 
of the X-mark (SC). We determined to the nearest year each subject’s present age 
(CA) and his age at the time of the alleged trauma (TR). To determine relative 
heights of ‘‘scars” and relative ages, we computed the ratios SC/HT and TR/CA. 
Then we computed for each group the product-moment correlation between the two 
ratios. 


RESULTS 


Our findings fail to imply any significant relationship between the locations of 
the subjects’ “lightning scars” and their (relative) ages at the time of their ‘“‘worst 
events”. For Group A (patients), the r between SC/HT and TR/CA is .13; for Group 


B (non-patients) it is -.04; and for Group C it is .02. None of these r’s reaches the 
5% level of significance. 
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In order to determine whether the distributions of X-placings (SC/HT ratios) 
differ among the three groups, we carried out analysis of variance. The obtained F 
value of 2.83 falls short of significance at the 5% level, an F value of 3.07 being re- 
quired for that level. Thus we cannot reject the hypothesis that these distributions 
are random samples from a common population (of X-placings). Hence it appears 
that there is no difference between hospitalized mental patients and normals, or be- 
tween those patients who were instructed to mark where the lightning would have 
struck the tree and those who were merely instructed to mark an X on the tree, with 
respect to where they tend to mark X’s on the trees. . 

Incidental Findings. Our subjects produced very few spontaneous scars. Fea- 
tures that might be taken for scars of some kind appeared on four patients’ and three 
non-patients’ tree trunks. The ‘‘worst events” may be of some general interest. They 
fall into classes as follows: being hospitalized, 12; physical injury or illness, 10; be- 
coming mentally ill, 7; death of a loved one, 6; clashing with the law, 2; and mis- 
cellaneous, 14. The miscellaneous events range from ‘Being born’”’ to “Spending the 
night in Grand Canyon.” 

SUMMARY 


Tree drawings were obtained from each of 51 hospitalized mental patients and 
31 non-patients. Each subject was induced to mark the tree “where lightning had 
struck at some time in the past’’ and to locate chronologically the ‘‘worst event”’ of 
his life. No significant correlation was found between the relative height of the 
“lightning scar’ on the tree and the relative age of the ‘‘worst event’’. These two 
groups’ ‘“‘scar’”’ placement was not significantly different from 39 psychotic subjects’ 
random placement of X-marks on prepared (duplicated) tree drawings. The ‘‘worst 
events” of the 51 patients are presented by category. 
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A GOODENOUGH SCALE EVALUATION OF HUMAN FIGURE DRAWINGS 
OF THREE NON-PSYCHOTIC GROUPS OF ADULTS 
MARY MARTHA MURPHY 


State Colony, Woodbine, New Jersey 


INTRODUCTION 

This investigation tests the hypothesis that human figure drawings of both 
maladjusted and normally adjusted adults of average or higher intelligence are so 
graphically immature as to be comparable to those of adult mental defectives as 
scored by Goodenough Draw-A-Man Test norms. Results of previous studies have 
suggested little difference in Goodenough scores of psychotic, non-psychotic and 
mentally deficient adults. Stonesifer®) reported Goodenough mean point scores of 
approximate borderline defective level for both a schizophrenic group and a control 
group of non-psychotic adults. Gunzburg®? reported a mean Goodenough IQ of 74 
for a group of non-pathological adult mental defectives. 


METHOD 


The Goodenough Draw-A-Man Test and a formalized intelligence scale were 
administered to a group of hospitalized chronic alcoholics, a group of job applicants 
and a group of institutionalized mental defectives. The alcoholic group consisted of 
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30 males who were hospitalized at the Veterans Administration Center in Bath, 
New York when the tests were administered. All had been diagnosed as psycho- 
neurotic with chronic alcoholism as a symptomatic manifestation. The mean age of 
the group was 53 with a range of 29 to 64 years. The average educational level was 
9 grades with a range of 5 to 18 years of formal educational background. Adminis- 
tration of Form I of the Wechsler-Bellevue Intelligence Scale disclosed a mean Full 
Scale IQ of 110 with a range of 92 to 125. Mean Verbal Scale IQ was 111 and mean 
Performance Scale 1Q was 108. 

The job applicant group consisted of 40 male applicants for employment at the 
State Colony at Woodbine, New Jersey. All were deemed sufficiently intelligent and 
adjusted to be eligible for job placement by virtue of past employment records and 
satisfactory attainment on employee selection tests. None submitted a history of 
treatment for emotional disorder. Mean age was 38 with a range of 21 to 56 years. 
The average educational level was 10 grades with a range of 6 to 14 years of formal 
educational background. Administration of the Wechsler-Bellevue I Vocabulary 
Scale disclosed a mean Weighted Score of 10 which is the average verbal subtest 
score for Wechsler’s 20-29 year age groups“’. Weighted Vocabulary Scores ranged 
from 8 to 14 which suggested that verbal intelligence of this group was distributed 
between low average and superior levels. 

The mentally deficient group consisted of 30 male working patients at the 
Woodbine Colony who had been selected for job placement on the basis of satis- 
factory records of work and social adjustment. Ages ranged from 20 to 54 years with 
a mean chronological age of 30. Educational backgrounds had been attained in the 
ungraded state colony schools with programs generally adjusted to individual 
capacities. None had histories of organic brain disease or injury. Form L of the 
Stanford-Binet Intelligence Scale disclosed a mean IQ of 62 with a range of 33 to 80. 


RESULTS 
No statistically significant differences were disclosed in average Goodenough 
scores of the three groups. Table 1 reveals little difference between the borderline 
defective mean performance levels of the alcoholic and job applicant groups and the 


TABLE 


1. CoMPARISON OF GOODENOUGH SCORES OF THREE NON-Psycnor 


1c Groups oF ApuLT MALES 


Alcoholic Group Job Applicant Mentally Deficient 
Factors N = 30 Group N 40 Group N 30 


Goodenouch Draw-A- 
Man Test 
Mean IQ | 70 
8. D. 19.6 
r 30* 
Mean MA | G4 


66 

18.5 
73R*** 

8-7 





*indicates correlation coefficient with Wechsler-Bellevue Full Scale 1Q. Mean 110; 8. D. 9.9. 
**indicates correlation coefficient with Wechsler Vocabulary Scale Weighted Score. Mean 10; 8. D. 
o7 
wf. 


***indicates correlation coefficient with Stanford-Binet IQ. Mean 62; 8. D. 15.3. 


mean high grade defective performance level of the group of mental defectives. The 
relatively high correlation (.78) between drawing scores and Stanford-Binet results 
for the mentally deficient group suggests that the Draw-A-Man Test may serve as an 
adequate measure of the general intellectual ability of adjusted mentally deficient 
adults. The correlation coefficients of Goodenough IQ’s with Wechsler Verbal Scale 
and Performance Scale I1Q’s were .31 and .24 respectively for the alcoholic group, 
and there was a near zero correlation coefficient (.04) between drawing scores and 
the weighted scores of the Wechsler Vocabulary Scale for the job applicant group. 
Human figure drawings of non-defective adults thus do not generally appear to be 


. commensurate with level of intellectual functioning in either the verbal or concrete 
performance areas, 
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DISCUSSION 

Goodenough? did not attempt to establish validity for the Draw-A-Man Test 
beyond the 10 year chronological age level because it was found that older children 
frequently failed to do themselves justice because of self-consciousness and ability 
to recognize difficulties involved in drawing a man. These attitudes seemed to hold 
true for adult subjects in the alcoholic and job applicant groups who generally ap- 
peared to associate the task with artistic aptitude and therefore expressed anxiety 
as to their ability for adequate performance on the test. The subjects in these groups 
in general engaged in the task rather unwillingly and remarked apologetically about 
their artistic ineptitudes and the quality of their finished productions. The test did 
not appear to constitute a similar threat to the mentally deficient working patients 
who proceeded to draw the figures without comment as to the possibility of creating 
an unfavorable impression with their drawings. It is possible that the drawing scores 
of the alcoholic and job applicant groups were lowered by negative attitudes and 
efforts to complete a disagreeable task as quickly as possible with a resulting paucity 
of scorable details included in their human figure drawings. It is also possible that 
artistic aptitude, a special ability, is a closely related factor affecting human figure 
drawing scores of adults as was verbally suggested by numerous subjects of this 
investigation. In such a case, results of the study would appear to indicate that as 
evaluated by Goodenough norms, drawing ability of adults does not in general de- 
velop beyond a middle childhood level. 


SUMMARY 


This study was designed to test the hypothesis that, as scored by Goodenough 
norms, human figure drawings of both maladjusted and normally adjusted adults of 
average or higher intelligence are so graphically immature as to be comparable to 
those of adult mental defectives. In order to ascertain the relative maturity of human 
figure drawings of three such groups, the Draw-A-Man Test and a formalized intelli- 
gence scale were administered to a group of hospitalized chronic alcoholics, a group of 
job applicants and a group of adjusted adult mental defectives. The formalized in- 
telligence scale results disclosed a range of mental ability of low average to superior 
level in the alcoholic and job applicant groups. The mean IQ of the mentally de- 
ficient group was of high grade defective level. Comparison of drawing performances 
of the three groups resulted in the following conclusions: 


1. Human figure drawings of maladjusted and normally adjusted adults of 
average or higher intelligence are so graphically immature as to be comparable to 
those of adult mental defectives. 


2. The Goodenough Draw-A-Man Test may be considered an adequate 
measure of the general intellectual ability of adjusted mentally deficient adults but 
is not indicative of the intellectual functioning level of non-defective adults whose 


drawings are not commensurate with verbal and concrete performance abilities as 
measured by other tests. 


3. Drawing performances of noo-delovtive adults may be inhibited by nega- 
tive attitudes toward the drawing task. 


4. Drawing ability of adults lacking in special artistic aptitude may possibly 
not develop beyond a middle childhood level. 
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CULTURAL VS. INTROPSYCHIC FACTORS IN CHILDHOOD BEHAVIOR 
PROBLEM REFERRALS 


E. LAKIN PHILLIPS 


Anderson Orthopedic Hospital 
Arlington, Virginia 


PROBLEM AND METHOD 


From sociological and learning theory viewpoints, the question can be raised as 
to whether the referral of behavior problem children for psychological help follows 
any systematic pattern. An intropsychic hypothesis such as the psychoanalytic 
one may be construed to mean that the occurrence of behavior problems stems from 
a non-cultural (7.e., psychosexual) base. This would mean that the probability of 
any child in the family being disturbed is equal to that of any other child, given 
typical psychosexual development. In the interest of testing the psychoanalytic 
hypothesis the files of a public guidance center and those of the author’s private 
practice were studied as to ordinal position and sex of 635 children referred for help. 
The records covered 7% years from the former source, 4 years from the latter. 


RESULTS 


The findings are tabulated in Table 1. The population studied excluded children 
under age 2 or over 16, and all others admitted for administrative or nonpsychologi- 


TABLE 1. OrpINAL PosITION AND Sex DistripuTION AMONG 635 CHILDREN REFERRED FOR 
PsycoLoGicaL HELP 





Birth Boys Girls Totals 





Order % N N % 





Oldest 64) 108 01) 54.65) 


Only | 13.23} 


Other 22 19.21 .f 150 





Totals 445 70.08 190 29 .92 635 100.00 





cal purposes. Only behavior problem cases referred ostensibly for psychological help 
were included. Control data came from the 1950 U.S. Census", and are adequate 
for comparison with the clinical group except that the Census cases cover ages zero to 
17, inclusively, and represent all cultural groups. Fertility rates data show 21.6‘ 
of the families (base: women, ages 15-59 inclusively) with zero children, 24.1, with 
1 child, 23.5°7 with 2, 13°, with 3, and 17.8°; with 4 or more. The first-born in- 
cidence equals 44.127, non first-born 55.88°;. Hence from frequence of first-born 
in general population, we would expect about 44°; occurrence of the same group in 
the clinical population. Table 1 shows the incidence of first-born in the clinical popu- 
lation to be 76.38% (where, as above, first-born includes only children). A signi- 
ficance of difference in proportion test shows the critical ratio between these two pro- 
portions (obtained 76% vs. expected 44°) to be 16.24, with p > .001. Thus the null 
hypothesis which asserts that no difference in the ordinal position exists among 
children referred for psychological help is rejected at a high level of confidence. In the 
same way the intropsychic hypothesis that behavior problems are not primarily due 
to learning-cultural factors is likewise rejected. This conclusion is limited to these 
data and does not apply to other aspects of psychoanalytic theory. 

Looking further at the data we see that the incidence of boys in the clinical 
population is 70°, a difference 10.14 times the standard error of the difference be- 
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tween the obtained 70% and the expected 50%. First-born boys occur 72.58% of the 
time, and first-born girls 85.26°%, in the clinical population. The likelihood that a 
behavior problem child of either sex is a first-born is extremely great. 


DISCUSSION 


By way of explanation of these data a parent-education hypothesis may be 
extended. First-born are more often viewed as problems and referred for help be- 
cause the parents are inexperienced as parents; they learn on the first-born. Boys 
are referred more often presumably because they are more aggressive and assertive; 
since they are supposed to be more assertive, it is often hard to know where to stop 
them, where to set appropriate limits. Data on only children may show a superficial 
inconsistency with these data“: 2: % 4. 6 8. 10, 1) Much research showing only children 
to be more neurotic than non-only ones was derived from matched comparisons 
on psychometric and other devices, whereas the present data are based on cases 
severe enough in the parents’ eyes to warrant referral. The prevalence of only child- 
ren in the general population is 24.1‘; ; in this study, 21.73°%; a difference unreliable 
even at the .10 level of confidence (C. R. = 1.33). 

Insofar as referred cases are concerned, it appears that the more general hy- 
pothesis relating ordinal position to behavior problem incidence is the more useful 
hypothesis. First-born are often seen as problems because the coming of sibs high- 
lights the differences between them. Similarly, with the only child, ‘parents have 
few or no reference points: they tend not to refer them for help although tests often 
point up their problems. These data, then, point in the direction of looking at be- 
havior problems not in intropsychic terms, but more in cultural and learning 
terms especially as these factors impinge upon or structure parental perceptions of 
child behavior®?’. Parent education procedures (discussion groups, talks, films) are 
seemingly better focal points for studying, preventing and ameliorating these prob- 
lems rather than long-term, depth procedures “: 5 7: 1, 


SUMMARY 


From child guidance records, 635 cases were examined as to sex and ordinal 
position. Reliably greater numbers of first-born occurred in the clinical population 
than in the general population. Whether male or female, the chances of being a first- 
born who is referred for psychological help is found to be great. Results were inter- 
preted in terms of parent-education explanations with emphasis on the readiness of 
parents to cope with the first-born vs. the non-first-born, rather than trying to ex- 
plain behavior problems on the basis of more psychoanalytic theories. 
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A SHORT FORM OF THE WECHSLER-BELLEVUE INTELLIGENCE SCALE 
FORM Il FOR A PSYCHOTIC POPULATION 
MARK B. CALDWELL AND JULIAN C. DAVIS 


Florida State University Florida State Hospital 


INTRODUCTION 

Brief and easily administered forms of intelligence tests have many disad- 
vantages and limitations in clinical practice. Namely, ‘they involve the expectation 
that a judgment of intellectual functioning . . . can be reached in a few minutes, . . . 
the discrediting of psychology because of unreliable or invalid results . .. and the 
reduction in the aspects of functioning which can be measured.’’?’ Cotzin and 
Gallagher“? pointed out some of the ways whereby the time saved could be utilized 
in exploring other necessary aspects of the patient’s functioning. 

The Wechsler-Bellevue scales give a reliable measure of adult intelligence and 
by the nature of their construction lend themselves readily to subdivision, con- 
sequently, the use of various abbreviated forms of these scales for the estimation 
of intellectual capacity has been suggested by several investigators. Three subtests: 
Comprehension, Arithmetic, and Similarities (CAS) were used by Rabin“? and 
Springer. “’. Correlation coefficients were found by these authors to range from .89 
to .96. Gurvitz,“? using Digit Span and Picture Arrangement obtained a coefficient 
of correlation of .90 between these two subtests and the Full Score. The best of these 
abbreviated Wechsler-Bellevue forms appears to be that suggested by Iriegman 
and Hansen.? Utilizing the four subtests, Vocabulary, Information, Block Design, 
and Similarities, correlations with the Full Scale of .91 and .94 were obtained. 


PROBLEM 
The purpose of this study is to check the validity of Kriegman and Hansen's“ 
short form of the Wechsler-Bellevue I in giving an accurate estimate of the Full Scale 
IQ in both psychotic and “normal”’ subjects on the Form II. Should the coefficient 
of correlation of the short form of the Wechsler-Bellevue Form II with the Full 


Scale prove sufficiently high, the findings take on an added significance as it affords 
a valid brief retest. An unselected psychotic group was used as the focal point for 
the investigation, simply because that is the group we are dealing with in a mental 
hospital. In our situation, to be useful, a short form must give an accurate estimate 
of the Full Scale IQ for the psychotic as well as the “normal” individual. 


MrtTHOD 

Four subtests—Vocabulary, Information, Block Design, and Similarities 
(hereafter referred to as VIBS I) which Kriegman and Hansen“? reported as a 
“quick, reliable and valid measure of mental ability,’’ were selected as a short form 
test of the Wechsler-Bellevue Form II. The rationale for the selection of these four 
tests is based on Wechsler’s notation that they are among the best in the entire bat- 
tery,“ as well as the fact that Kriegman and Hansen report a higher correlation 
with these Full Scale 1Q’s than did Rabin, Springer, and Gurvitz with their short 
forms. 

The 1Q’s for the VIBS II were obtained by multiplying the total weighted 
score of the tests by 2.5 since the full Wechsler equals 10/4 VIBS. The estimate of 
the Full IQ was then obtained from Wechsler’s tables. Following the same pro- 
cedure another IQ was calculated from the VIBS II scores using a prorating factor 
of 2.7. This empirical factor was suggested by Kriegman and Hansen on the belief 
that the theoretical factor of 2.5 consistently underestimated intelligence. Co- 
efficients of correlation between the VIBS II IQ’s with prorating factors of both 2.5 
and 2.7 with the Full Scale Wechsler-Bellevue II 1Q were obtained. 

The present study was based on the data from tests of 85 psychotic individuals 
and a control group of 53 tests of “normal” persons. Eighty-five Wechsler-Bellevue 
Form II records were selected from the patient files at the Florida State Hospital. 
The sampling of cases was on a random basis with no attempt being made to select 
them in regards to age, sex, mental diagnosis, or IQ. The only requisite was that all 
the subtests be administered. In the same random manner 53 normal test records of 
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the Wechsler-Bellevue Form II were taken from student files at Florida State Uni- 
versity augmented by a few Florida State Hospital student nurse protocols. 


RESULTS 
The psychotic group age range was 15 to 50 years with the median at 27 years. 
The age range of the ‘‘normal”’ group was 17 to 27 years with the median at 20 years. 
The psychotic group was about equally divided as to sex while the normal group was 
predominately female. 


TaBLe 1. IQ Rance, Mepran IQ AnD CorRELATIONS ON Eacu Grovp OF THE VIBS II 








IQ Range Mean IQ r 





Groups| W-B | VIBS2.5) ‘VIBS(2.7) |W-B VIBS(2.5) VIBS(2.7) | W- Be. 5) | W-B(2.7) 
I iW II II . i 8 


Peychotics | 45-127/ 44-146 | 46-146 | 77 79 sz | 94 | 93 
N-85 | 


Normals | 91-139! 85-146 | 92-146 120 123 i30 | 91 | .91 





Table 1 summarizes the characteristics of two groups of cases. Here it is noted 
that the widest variation between the Full Wechsler-Bellevue II 1Q’s and the VIBS 
If IQ’s is at the upper levels, that is, there is a tendency for the VIBS II to over- 
estimate the intelligence of the superior individual. Undoubtedly, it is this tendency 
which accounts for the slight difference in the correlation between the VIBS II and 
the Full Scale Wechsler-Bellevue II 1Q’s on the psychotic and normal groups. A 
slightly higher correlation is obtained in the psychotic group because fewer of its 
members are functioning at a superior level. 

There is no evidence of significant advantage in using Kriegman and Hansen’s 
empirical prorating factor (2.7) over the theoretical factor (2.5). 


SUMMARY 


Previously proposed abbreviated forms of the Wechsler-Bellevue Form I 
which have been standardized on various groups, were considered for application to 
Form II, not only as a check on the original thesis regarding the validity of the 
short form test, but also to furnish a retest. The subtests, Vocabulary, Information, 
Block Design, and Similarities, were chosen on the basis of Wechsler’s data and the 
work of Kriegman and Hansen. On the basis of results obtained from 85 unselected 
psychotic protocols and an unselected group of 53 normal students we conclude: 


1. VIBS Form II is equally as valid an indication of Full Scale Form IT IQ as 
its Form I counterpart. 

2. VIBS IL isa valid estimation of the Ful! IQ in individuals of no better than 
bright normal intelligence. IQ’s of superior individuals are overestimated. 

3. We agree with Kriegman and Hansen“? that “when a brief qualitative 
estimation of mental ability is desired the VIBS is a valid... test.” 
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PROFESSIONAL INSECURITY: AN OCCUPATIONAL NEUROSIS OF 
CLINICAL PSYCHOLOGISTS 


The current situation of the profession of clinical psychology contains many 
factors conducive to anxiety reactions which may assume disabling proportions. 
Clinical psychology is a young profession interacting and competing directly with 
older and more powerful disciplines. Many clinical psychologists feel a compulsion 
to acquire scientific respectability by identifying themselves with better established 
psychiatric authorities and viewpoints, and hence have borrowed for their own use 
many techniques from borderline fields. Clinical psychology itself has developed so 
rapidly that many of its techniques are unvalidated with the result that there is 
serious doubt of the validity and reliability of many current practices. The situation 
is complicated by a difficult problem of inter-professional relationships with psy- 
chiatry, with many clinical psychologists finding themselves working as ancillary 
technicians under the supervision and responsibility! of physicians, and with those 
who are working independently having constantly to justify their status in the heal- 
ing arts. Finally, public demand for psychological services has enlarged faster than 
c.inical psychologists can develop and acquire the necessary skill and competency. 
Clinical psychologists constantly find themselves confronted with problems and de- 
cisions which are actually beyond their resources with the result that they must ad- 
mit incompetence or else blunder along ove: their heads. 

Professional insecurity stimulates many types of defensive reactions manifested 
both in clinical practice situations and also in all types of professional and inter- 
professional relationships where ego status is at stake. In letters to the editors, 
several of our readers have pointed out that in dealing with alleged infractions of 
professional ethics we sometimes seem to be leaning over backwards in meting out 
over-severe punishments to violators. Some of our colleagues show undue sensitivity 
when the validity of their practices is questioned although this is completely under- 
standable since their whole professional competence woud disappear if their tech- 
niques were proven invalid. Others reveal insecurity in the intensity with which they 
identify themselves with authorities such as Freud, Adler or Jung, or with current 
psychiatric leaders. Even at professional conventions and scientific meetings, clinical 
psychologists seem to us to manifest a great deal of insecurity in the forced spon- 
taneity and affectedness of their professional role-playing over and beyond that dis- 
played at conventions of other professions. One gets the impression that many of us 
are frightened and insecure behind a mask of stereotyped role-playing behavior. 

What is the antidote for insecurity? Security cannot be achieved by propping 
up a house of cards with makeshift devices and defensive mechanisms. It can only be 
gained by building up foundations of basic science knowledge and valid clinical 
practices so that the clinician feels secure in his competence and supported by the 
validity of his predictions and outcomes. In other words, clinical psychologists will 
feel secure in direct proportion to the validity of their underlying premises and prac- 
tices. In the light of all this, some current attitudes of clinical psychologists to basic 
problems of diagnosis seem paradoxical. A survey of the annual reviews of research 
in clinical psychology conducted by this Journal in the last five years (1951-1955) 
reveals a relative lack of interest in basic problems of diagnosis. For example, only 
once in the last five years did the area of validity studies of psychiatric diagnosis 
rank among the first ten in incidence, while other important areas of diagnostic 
study were virtually ignored. In contrast, objective studies of the outcomes of psy- 
chotherapy are becoming increasingly popular even in cases where authors virtually 
say that they are not interested in the diagnoses of their subjects, or to find out how 
they got that way, but only in how well their favorite method of therapy works! We 
find powerful schools of therapy and prominent leaders in the field virtually ignoring 
problems of diagnosis on which all validity studies must be based, and instead de- 
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fending themselves with snide comments to the effect that an interest in diagnosis 
implies “authoritarianism”’, ‘‘directiveness’”’ or ‘‘management”’ rather than psycho- 
therapy, the imposition of preconceived diagnostic rubrics onto dynamics which are 
essentially individual, and that to diagnose another person is ‘undemocratic’ or even 
unethical because it infringes on his right to be himself and not to be tampered with, 
etc. In some localities of clinical psychological practice, the person interested in 
diagnosis as a basic issue is even regarded as a freak whose efforts are ignored or dis- 
paraged. All this is considered paradoxical because such attitudes seem to be de- 
riding the very skills upon which all clinical psychology must depend for its validity. 
The foundation of all valid practice in clinical psychology must be valid diagnosis 
because it is the one distinctive contribution which we have to make. Professional 
psychologists pride themselves primarily because of their basic science knowledge 
and skills in applying scientific statistical-experimental methods to clinical problems. 
But psychologists do not have any monopoly in these areas, and any advantage they 
currently have will be lost quickly when other professions master the same areas. 
Psychologists actually are at a great disadvantage in the field of psychotherapy 
where, other things being equal, psychiatrists are better trained. The one field in 
which clinical psychologists do have an advantage is in psychological measurement 
and diagnosis and, in fact, these are the skills most demanded by other professions 
and the general public. Professional insecurity may be expected to continue as long 
as rationalizations and defensive mechanisms are utilized to avoid coming to grips 
with basic problems of diagnosis. Diagnosis is the basic foundation of clinical 
science, and methods which disregard it are not only unscientific but productive of 
insecurity and defensiveness. 
F. C. T. 
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Derived experimentally by James W. Howard, Ph.D. 

An individual test consisting of 12 plates 814” x 11”. The blots are 
somewhat larger than the Rorschach and with a wider range of 
stimulating features. Blots of varied colors selected from an experi- 
mental sample of 10,000 blots. 

Not intended to be a parallel test to the Rorschach. Statistical find- 
ings from a ‘‘normal’’ group show the test to differ markedly from 
the Rorschach in ways which should indicate greater sensitivity. 
Selection and arrangement of blots made on the basis of evidence of 
character, degree and spread of stimulation. 

Data from normative sample indicate more frequent light deter- 
mined responses, higher incidence of color responses with different 
C, CF and FC relationships, more movement responses and fewer 
animal responses. Greater diagnostic sensitivity with wider range 
of response. 
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Grinker, R. R. (Ed.) Toward a Unified Theory of Human Behavior. New York: 
Basic Books, 1956, pp. 375. $6.50 


This book summarizes the high points of four conferences of leading psychologi- 
cal scientists which were privately organized by Dr. Grinker at Michael Reese Hos- 
pital to explore a unified theory of behavior. Dr. Grinker contributes sections on the 
intrapersonal organization, comparisons between systems of organizations and a 
summary; John P. Spiegel on a model for relationships among systems, and com- 
parison of psychological and group foci; David Shakow on the psychological system 
and comparison of psychological and group foci; Jurgen Ruesch on human com- 
munication theory, autonomy and boundaries according to communications theory, 
and the analysis of various types of boundaries; Talcott Parsons on the social system, 
the relation between the small group and the larger social system, and boundary 
relations between sociocultural and personality systems; Laura Thompson on the 
societal system, culture and the community; Florence Kluckhohn on value orienta- 
tions; Jules Henry on a system of socio-psychiatric invariants and homeostasis in a 
special life situation; James E. P. Toman on multiple origins of the uniqueness of 
human society and stability vs. adaptation; Alfred E. Emerson on homeostasis and 
comparison of systems; Lawrence k. Frank on social systems and culture; Anatol 
Rapoport on homeostasis, and statistical boundaries, and Karl Deustch on autonomy 
and boundaries according to communications theory. These high level discussions 
provide very stimulating reading for pioneering in multi-discipline approaches to 
behavior study. 


Srern, G. G., Stern, M. I. anp Broom, B. 8. Methods in Personality Assessment. 
Glencoe, Ill.: Free Press, 1956, pp. 271. $6.00 


This book describes four methods of assessment methodology—analytical, 
empirical, synthetic and configurational—for the study of individual subjects in re- 
lation to the environments with which they are to interact, and also to compare the 
results of the analysis of individuals with the characteristics of hypothetical models. 
The application of these approaches are presented in relation to studies of theological 
students, teacher-trainees, graduate students and college freshman. Among other 
methods used, special attention is given to the results obtained from the Activities 
Index, the Inventory of Beliefs, the Physiognomic Cue, and Drawings. Three syn- 
dromes were discovered which are labelled S (Stereotypy), N (Non-Stereotypy) and 
R (Rational). The Stereopath is comparable to the authoritarian personality estab- 
lished in other studies and is almost antipodal to the Non-Stereopath. The syn- 
dromes are differentiated dynamically in terms of reactions to others, coping mech- 
anisms, impulse acceptance and control, energy level, autonomous-homonomous 
balance, self-maintenance, and organization-integration. The authors have suc- 
ceeded well in enlarging the scope of the factors which must be measured in any 
comprehensive assessment program. 


Wa Len, R. W. Clinical Psychology. New York: McGraw-Hill, 1956, pp. 388. $6.00 


The author is associate professor of psychology at Western Reserve University. 
This is an elementary textbook, sound but rather uninspired, which gives a good 
operational account of what clinical psychologists do in a series of chapters describ- 
ing general aims, methods of prediction, nature of clinical judgment, interview and 
observational methods, medical assessment, projective tests (Rorschach and TAT), 
psychometrics, and the nature of psychotherapy. 
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WEINBERG, H. anp Hire, A. W. Case Book in Abnormal Psychology. New York: 
Knopf, 1956, pp. 320. $4.00 
Twenty case histories are presented largely in historical-descriptive styles for 
use in introductory classes. Should prove valuable for class discussion purposes. 


STEVENSON, G. 8. Mental Health Planning for Social Action. New York: McGraw- 
Hill, 1956, pp. 358. $6.50 
Dr. Stevenson has an established reputation as one of the foremost mental 

health authorities in the world. This book discusses the mental health movement 

operating to restore, protect and elevate mental health. Typical community re- 
sources are described with a positive program for how each may be used most con- 
structively. 

Sartre, J-P. Being and Nothingness. New York: Philosophical Library, 1956, pp. 
638. $10.00 
This essay on phenomenological ontology is the source work on existentialism. 

Excellently translated by Hazel E. Barnes, it includes a welcome vocabulary of 

special terms. 

McCartney, J. L. Understanding Human Behavior. New York: Vantage Press, 
1956, pp. 258. $3.50. Essays for the layman to help them understand psy- 
chiatry. 

CouEN, J. AND HANsEL, M. Risk and Gambling. New York: Philosophical Library, 
1956, pp. 153. $3.50 

Loewy, Herta. Training the Backward Child. New York: Philosophical Library, 
1956, pp. 165. $3.75. 

HitscHMANN, E. Great Men: Psychoanalytic Studies. New York: International 
Universities Press, 1956, pp. 278. $4.00 

Wueattey, G. M. anp Hatuockx, G. T. Health Observation of School Children. 
New York: McGraw-Hill, 1956, pp. 488. $6.50. Second edition. 

JoHNson, W., Brown, 8. F., Curtis, J. F., Epny, C. W. anp Keraster, J. Speech 

Handicapped School Children. New York: Harper, 1956, pp. 575. $4.50 

Houspen, L. The Prevention of Cruelty to Children. New York: Philosophical Lib- 
rary, 1956, pp. 406. $7.50 

Roun, J. Police Drugs. New York: Philosophical Library, 1956, pp. 194. $4.75 








SPECIAL TEST REVIEW 





THE CATTELL 16 PERSONALITY FACTOR QUESTIONNAIRE 


The 16 Personality Questionnaire by R. B. Cattell and G. F. Stice, Forms A, 
B and C. Handbook and scoring keys 1950. The Institute for Personality and 
Ability Testing, 1602 Coronado Drive, Champaign, Illinois. 


Whereas the relative freedom for experimentation on the part of the academic 
psychologist may make it possible for him to give all kinds of tests to a subject, this 
is not the case with a busy institutional or practicing psychologist. The truly ap- 
plied psychologist must use diagnostic tools that pay off accurately and quickly. His 
tests must give him objectively derived insights which he cannot develop so surely 
without the tests in the comparatively short time at his disposal. In addition to the 
usual demands of scientific accuracy embodied in the concepts of reliability and 
validity, the applied psychologist must use tests which are useful in a practical way. 
It appears to this reviewer that Cattell’s 16 P. F. Test is the best test of personality 
thus far developed to meet the stringent requirements of applied and clinical psy- 
chologists for accuracy, usefulness, and brevity. 

Despite the fact that the clinical psychologist interacts with his patient on a 
verbal level by the exchange of questions and impressions, he has been reluctant to 
use questionnaires. Instead, he has favored the esoteric projective devices with their 
apparent sophistication and somewhat more obvious limited reliability and validity. 
With growing evidence of the low reliability and validity of the Rorschach and other 
projective tests, and with the success of the M. M. P. L., it is no surprise that there is 
a new research emphasis on questionnaire tests. This new interest in questionnaire 
tests also stems from the fact that they can be based on sound research rather than 
upon the arbitrary categories employed in the some fifty questionnaires used during 
the past quarter of a century. The new questionnaires are essentially those of Guil- 
ford-Zimmerman, Thurstone, and Cattell. It is with the latter of these that this 
review is concerned. 

A tremendous amount of theoretical and practical research underlies Cattell’s 
16 P. F. Test. The questionnaire factors are built upon factors first found in behavior 
ratings employing the broad ‘‘personality sphere concept.’’ Thus all the important 
dimensions of personality are caught in this test, yet its use is so simple as to make it 
available to psychologists with only a minimum of specific orientation to this test. 
It has been constructed by the time consuming process of determining the factor 
structure of each item instead of the easier technique of factorizing whole blocks of 
questions. Furthermore, there is no overlap of questionnaire items among the 
different factors which eliminates a principal source of markedly spurious correla- 
tions. Because of the individual item factorization and because of the lack of overlap 
in questionnaire items among the different factors, this test appears to the reviewer 
to be methodologically superior to all other questionnaire tests. 

To quote its author, ‘This test is designed to give the maximum information in 
the shortest time about the most dimensions of personality.’”” Now this test, which 
takes less than an hour to administer and score, provides the clinician with the max- 
imum number of identifiable source traits of personality. These source traits are not 
arbiti:arily decided upon but are the result of many laborious factor analyses. These 
are the mathematically isolated and clinically described minimum factors necessary 
to account for the variance of ratings in real life behavior situations, objective tests, 
and clinical and social performances. 

The 16 P. F. Test, as would be guessed by its name, identifies and measures 
sixteen basic traits. Fifteen of these traits are dynamic source traits of a tempera- 
mental nature. One of them (Factor B) measures practical intelligence which cor- 
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responds to the well known “g’”’ factor of general mental capacity as this factor 
affects personality. Naturally this and the other factor measures cannot be as ac- 
curate as, for example, is a single hundred item intelligence battery. However, for 
the busy clinician it is a distinct advantage to have all sixteen of these measurements 
available for observation in less than an hour’s time. 

The Cattell 16 P. F. Test comes in three forms. Forms A and B require a lan- 
guage facility equivalent to that used in everyday newspapers. These two forms 
require a minimum of supervision and can be given and scored all within an hour 
even allowing for the slow reading time on the part of some taking the test. Form 
C of the test, which is the latest form published, is for industrial use and is both 
shorter and uses simpler language than Forms A and B. All the tests can be machine 
or hand scored. From a mechanical point of view, the test may be scored by hand 
on the test booklet, it may be scored by hand on the machine answer sheet, or it may 
be machine scored. For quantities of less than 500, it is not felt worth while to 
employ machine scoring. These scores are raw scores and adequate tables are given 
to permit their conversion to standard or decile scores. 

The reliability of the factor measurements, as determined by equivalence cor- 
relation between the measures on the different forms, varies from .4 to .6. While 
these are not high in the conventional sense of reliability as used by psychologists, 
they are probably higher than the reliabilities of many of the instruments which 
clinicians are using confidently, and it is certainly as high as one can expect from a 
factor measure reduced to ten items in length. In fact, it becomes clear from the 
research literature that a great deal of research is necessary to locate ten items which 
taken together will yield this much saturation in a given factor. Two further points 
on the reliability of this test seem worth while noting. In the first place, if one wanted 
more reliability than is obtained on a single form of the test (a desire which might 
be motivated more by research aims than by practical clinical needs) one would 
need merely to use two forms. Inasmuch as brevity is one of the big virtues of this 
test, doubling the testing time by using both forms is not thought universally useful 
by this reviewer even though it improves the test’s reliability. 

Certain mechanical features of the test recommend its use. To begin with, 
Form C has a motivational distortion scale which determines what degree of ‘‘faking”’ 
the subject may be attempting in his answers. In addition, there is a questionnaire 
for children from the ages of 10 to 16 years which can be used to get at genetic per- 
sonality factors when time permits the use of the instrument over a number of years. 
Besides, the number of useful questionnaire personality tests for children is very 
small. Finally, the printing of the test itself is easy to read so that the amount of 
fatigue and subsequent distortion of motivation are held to a minimum. 

The factors which are measured by this test are measured in terms of extremes 
of possession of each trait. In other words, the test does not go from zero to a plus 
or zero to a minus value on any trait but instead permits a free fluctuation of per- 
formance from minus to plus through zero or from one extreme to another on the 
same trait. The factors measured by the test are as follows: 

Factor A —Cyclothymia Versus Schizothymia 

Factor B —General Intelligence Versus Mental Defect 

Factor C —Emotional Stability or Ego Strength Versus Dissatisfied Emotionality 
Factor E —Dominance or Ascendance Versus Submission 

Factor F —Surgency Versus Desurgency, or Depressive Anxiety 

Factor G —Character or Super-Ego Strength Versus Lack of Internal Standards 
Factor H —Adventurous Autonomic Resilience Versus Inherent, Withdrawn Schizothymia 
Factor I Emotional Sensitivity Versus Tough Maturity 

Factor L —Paranoid Schizothymia Versus Trustful Altruism 

Factor M —Hysteric Unconcern (or ‘‘Bohemianism’’) Versus Practical Concernedness 
Factor N —Sophistication Versus Rough Simplicity 

Factor O —Anxious Insecurity Versus Placid Self Confidence 

Factor Q! —Radicalism Versus Conservatism 

Factor Q? —Independent Self-Sufficiency Versus Lack of Resolution 


Factor Q? —Will Control and Character Stability 
Factor Qt —Nervous Tension 
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Now it should be clearly stated that these sixteen factors were not first arbi- 
trarily named to conform to some clinical hypothesis or persuasion and then con- 
structed into tests. Actually the traits were found by repeated factor analyses as 
unquestionable unitary or source traits. After these traits were established beyond 
any mathematical doubt, they were named according to the content of the items 
in a factor and this factor’s correlations with various clinical and industrial criteria. 
While some of the trait descriptions may appear strange to psychologists not ac- 
quainted with Cattell’s writing, the authority of the trait’s existence cannot be 
questioned on the mere assumption of a distaste for its label. Of course, further 
clinical use of this test will doubtless refine the trait descriptions but it is diffleult to 
see how much improvement can be made on the mathematical isolation of the traits 
from the original tests, observations, and behavior studies which made up the 16 
P. F. Test. 

The question will naturally arise as to whether the 16 P. I’. Test can add enough 
to the clinician’s insights to justify its use in the limited time at his disposal. To be 
sure, the 16 P. F. Test does not deal with problems of a dynamic conflict peculiar to 
any particular patient. The factor personality tests make no effort to handle the 
unique emotional problems of a given patient. However, it will be remembered that 
the eminent physician Sir William Osler once wrote, “It is as important to know 
what patient has the disease, as to know what disease the patient has.’’ Now in this 
reviewer’s opinion, based upon experience in his own practice, the 16 P. F. Test 
gives more meaningful information about a given patient in the short time involved 
in its administration and scorir.g, than does any other test. Where there is time at 
hand, this test can and should be supplemented with tests of ability and the more 
dynamic projective tests, but where only one personality test can be used, it is the 
reviewer's test of choice. 

When a patient is to be seen over a fairly long period of time for purposes of 
psychotherapy, the clinician needs to know something of the inner resources of the 
patient. The clinician needs to know something about the patient’s resources of 
personality for handling conflict and, of course, for handling psychotherapy. It 
is necessary to know something of the frustration-tolerance level of the patient, what 
his super-ego strength is, and whether he is likely to move into a schizoid form of 
adjustment to his situation. The 16 P. F. Test can supply this information since the 
levels of ego strength, super-ego strength, free floating anxiety, etc., have been stand- 
ardized against the general population and permit in a glance a picture of where the 
patient stands. 

The 16 P. F. Test will reveal more reliably then the interview, as Kelley and 
others have shown, how the patient stands in his general personality in relation to 
the average of the population. This test yields significant clues as to the kind of 
temperament a given patient has and to the strains and stresses which he is able to 
bear in the analytical process. One of the outstanding contributions of this test is 
the extent to which it measures the presence of neurosis in a patient. The test pro- 
files show that neurotics differ from the general population significantly in four 
factors, viz. factors C (ego weakness), I (anxiety 1), O (anxiety 2), and Q* (psycho- 
somatic anxiety). Skillfully interpreted, this test could provide a clinician with a lot 
of data that would assist the clinician in mapping out a therapeutic program for a 
patient. 

This test provides the guidance worker with data of a more positive statistical 
character than are available to the clinical psychologist at this time though there are 
data presented for schizophrenics, manic-depressives, neurotics, psychopaths, and 
convicts. In the handbook, there are over twenty varied occupations represented by 
test profiles against which comparison of a given individual’s scores can be made 
readily from an included nomograph in terms of pattern correlation coefficients. In 
some cases the correlations of individual factors with success, as measured by occupa- 
tional criteria, are also available. For example, a weighting formula has been pro- 
vided from a study of a thousand cases for picking the individuals who will stand out 
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as leaders and successful supervisors. This formula involves positive loadings on 
ego strength, surgency, and super-ego strength, and negative loadings on anxiety 2 
and anxiety 3. It is curious that dominance, as reflected in this test, seems to play 
only a minor role in determining successful leadership. It is also of interest to note 
that surgency predicts the likelihood of being elected to a role of leader better than 
it does the likelihood of being an effective leader. These and many other practical 
relationships are worked out in the handbook. In fact, the interesting possible 
interrelationships of the various scoring patterns with different criteria make the 
possible applications of this test loom quite large and inviting to the research minded 
applied psychologist. 

One study of special interest to psychologists is reported in the handbook. It 
involves an investigation of successful and unsuccessful psychiatric technicians 
studied with the 16 P. F. Test. This study shows that success can be predicted from 
a positive loading of ego strength and of super-ego strength, together with a negative 
loading in the radicalism factor. Perhaps the strong radical feels the world is so 
bereft of its senses that he cannot put up with further contacts with unstable in- 
dividuals in his daily life, or perhaps his inability to accept leadership rule would 
make him a poor member of any medical team. These and other interesting studies 
are reported by Cattell in the handbook. Besides, an extensive bibliography of the 
pertinent research on the test and the factors is included. This test is not the result 
of some arbitrary opinion put together in a casual way with mathematical packaging 
to make it look respectable. It is the culmination of the first stage of an extensive 
research program and it is to be expected that further refinements will increase its 
usefulness. 

At the present time, the most serious limitation of its usefulness is the small 
sample included in the occupational and clinical profiles published in the handbook. 
It is the reviewer’s opinion that while these profiles are quite suggestive, larger sam- 
ples are needed before the profiles can be too heavily leaned upon in counseling. But 
this is a kind of research that is underway and will doubtless be reported from time 
to time. Another place where the reviewer feels the data could be more helpful to 
the person using the test is in the converted scores. The reviewer feels that a ten 
point scale is a little too crude to permit the profile sensitivity needed for some clin- 
ical assessment. But this is a kind of refinement that may appear to be more valuable 
than it really is. At least one of the most attractive features of the 16 P. F. Test, be- 
sides its pragmatic value to the applied psychologist, is in the broad vista of research 
possibilities that can reward the applied psychologists whose opportunity for re- 
search is usually quite limited. 

An objection may be raised about the strange language of the test. Psycho- 
logists may question the terms ‘“‘surgency” or ‘‘Bohemian unconcern.”’ Of course, 
these are only names and if better names are available, then they should be used. 
But with continued research showing that psychological sense can be found in the 
factors, however strange they may initially be to the person unaccustomed to the 
nomenclature, the psychologist should get to know these factors, their mode of 
action, and their natural history. It is the reviewer’s opinion that Cattell’s 16 P. F. 
‘Test will become more and more a standard part of the psychologist’s armamentarium 
which will be a justified endorsement of a superior product. 


Robert P. Fischer, Ph.D. 
Marietta, Ohio 
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